Vol. 61, No. 1 


Psychological Review 


THEODORE M. NEWCOMB, Editor 
UNIVERSITY OF MICHIGAN 


Lorraine Bouthilet, Managing Editor 





CONTENTS 


David Katz 1884-1953 Rosert B. MacLeop 1 
The Physiology of Motivation Exiot STELLAR 5 


The S-R Reinforcement Theory 
of Extinction Henry GLEITMAN, JACK NACHMIAS, 
& Uxric NEIsser 23 


Punishment: I. The Avoidance 
Hypothesis James A. Dinsmoor 34 


The Measurement of Values..................20005- L. L. THurstone 47 


A Neural Model for 
Sign-Gestalt Theory James Oxps 59 


The Place of Physiological Constructs 
in a Genetic Explanatory System GupMuND SMITH 73 


A Note on Stimulus Intensity Dynamism (V) Frank A. LoGaN 77 





PUBLISHED BIMONTHLY BY THE 
AMERICAN YSYCHOLOGICAL ASSOCIATION, INC. 





CONSULTING EDITORS 


Sotomon AscH Rosert B. MAcLeop 
Rosert BLAKE Davip C. McCLeLLanp 
Stuart W. Cook G. A. MILLER 


CLiypDE Coomss GARDNER MurRPHY 
LEON FESTINGER Oscar OESER 


W. R. GARNER Carrott C. Pratt 
James J. Grsson 
Davin SHAKOW 


D. O. Hess x ‘ 
Harry HELson ICHARD SOLOMON 


E. R. Hincarp Exiot STELLAR 
Cart J. Hovtanp S. S. STEVENS 
E. Lowett KEtty Eric TRIst 
Davin Krecu EpwarD WALKER 
Rosert W. LEEPER Rosert WHITE 








The Psychological Review is devoted to theoretical articles of significance 
to any area of psychology. Except for occasional articles solicited by the 
Editor, manuscripts exceeding twelve printed pages (about 7,500 words) are 
not accepted. Ordinarily manuscripts which consist primarily of original re- 


ports of research should be submitted to other journals. 


Because of the large number of manuscripts submitted, there is an in- 
evitable publication lag of several months. Authors may avoid this delay if 
they are prepared to pay the costs of publishing their own articles ; the appear- 
ance of articles by other contributors is not thereby delayed. 


Tables, footnotes, and references should appear on separate pages; all of 
these, as well as the text, should be typed double-spaced throughout, in all 
manuscripts submitted. Manuscripts should be addressed to the Editor, 
Dr. Theodore M. Newcomb, Doctoral Program in Social Psychology, Uni- 
versity of Michigan, Ann Arbor, Michigan. 


PUBLISHED BIMONTHLY BY THE 
AMERICAN PSYCHOLOGICAL ASSOCIATION, INC. 
1333 SIXTEENTH ST. N. W., WASHINGTON 6, D. C. 
$6.50 volume $1.25 issue 


Entered as second-class matter July 13, 1897, at the post-office at Lancaster, Pa., under Act of Congress of 
March 3, 1879 


Acceptance for mailing at the special rate of postage provided for in paragraph (d-2), Section 34.40, 
P. L. & R. of 1948, authorized Jan. 8, 1948 


Copyright 1954 by the American Psychological Association, Inc. 











Davip Katz 





VoL. 61, No. 1 


JANUARY, 1954 


THE PSYCHOLOGICAL REVIEW 





DAVID KATZ 
1884-1953 


David Katz, Professor Emeritus at 
the University of Stockholm, died of 
a sudden heart attack on February 2, 
1953. By those who attended the In- 
ternational Congress of Psychology in 
Stockholm in July, 1951, he will be re- 
membered as the indefatigable organizer 
and genial host of the congress. In the 
history of psychology his name will be 
associated with significant contributions 
to almost every field of psychology, pure 
and applied; and he will be cited as one 
of this century’s outstanding exponents 
of psychological phenomenology. In 
the memories of those who knew him 
and loved him he will live as a gentle, 
humble man, persistently curious about 
everything that had to do with human 
nature, brilliant in his intuitions, tire- 
less in his research, unfailingly generous 
and courteous in controversy. 

Katz was born in Kassel, Germany, 
on October 1, 1884. His early educa- 
tion was in Kassel, his university edu- 
cation in Berlin, Munich, and Gott- 
ingen, where he received his doctoral 
degree in 1906. In Gottingen he was 
one of G. E. Miiller’s most brilliant pu- 
pils. Later he became Miiller’s assist- 
ant, and, in 1911, Privat Dozent. Dur- 
ing World War I he was called to army 
service for four years, returning after- 
wards to his post in Gottingen. It was 
during his Géttingen period that he 
completed his now classic researches 
on the experimental phenomenology of 
color, and began his less well-known 
but equally significant work on touch. 


In 1919 he accepted the chair of psy- 
chology and education at the University 
of Rostock, where he developed what 
eventually became one of the most 
productive psychological laboratories in 
Europe. In 1933 the National Social- 
ist party came inte power, and Katz, 
as a non-Aryan, was deprived of his po- 
sition. Fortunately his British friends 
were willing to provide hospitality and, 
for the next four years, first in Man- 
chester and later in London, he was able 
to pursue his scientific work. In 1937 
he accepted the chair of education (in- 
cluding psychology) at the University 
of Stockholm, where he remained until 
his retirement in 1952. 

Katz paid two visits to the United 
States, in 1929 as Visiting Professor at 
the University of Maine and in 1950 as 
Hitchcock Lecturer at the University of 
California. 

During his period as G. E. Miiller’s 
assistant, Katz was fond of relating, an 
attractive young Russian girl was ad- 
mitted as a student. In reply to Katz’s 
query, Miller characterized her as “eine 
Madonna mit einer Bombe.” Rosa 
Heine did not blow up the Institute, 
thereby failing to conform to Miiller’s 
stereotype of the Russian, but she 
speedily conquered Miiller’s assistant. 
Katz and Rosa Heine were married in 
1919. Numerous joint publications at- 
test to their productivity as a scientific 
team. Their two sons, now launched 
on their own professional careers, were 
made prematurely famous by their 





ROBERT B. 


parents’ book, Gesprache mit Kindern 
(1927). 

To review Katz’s contributions to 
psychology would be a major under- 
taking. As a scientist he had “green 
fingers.” He had but to touch a prob- 
lem, and it readily blossomed and bore 
fruit. His list of publications includes 
more than 100 titles, of which at least 
20 are substantial books and mono- 
graphs. Among these one finds con- 
tributions to animal, child, educational, 
abnormal, and social psychology, to the 
experimental psychology of perception, 
motivation, learning, and thinking, to 
systematic theory, and to laboratory in- 
strumentation. It may be that he scat- 
tered his energies too widely; certainly, 
not all his researches are of equal merit. 
It was his genius, however, to find in 
the commonplace observations of daily 
life problems which, when viewed in a 
larger context, became significant, and 
to make psychological capital out of 
every new experience with which good 
or bad fortune provided him. Thus, 
his wartime assignment to a military 
hospital led to a pioneer study of the 
psychological problems of amputees, 
and later to the invention of a device 
for the training of students in the tech- 
nique of percussion; the feeding prob- 
lems of his children contributed to his 
interest in constitutional typology and 
in the theory of hunger and appetite; 
his own difficulty with the English and 
Swedish languages challenged him as a 
psychologist to do some experiments on 
problems of language and thinking. 

It was also his genius to find simple 
and inexpensive ways of attacking ma- 
jor problems. Katz belonged perforce 
to the cardboard and thumbtack school; 
but he never allowed a meager budget 
to hamper his activity. In Rostock he 
was faced with the task of developing a 
research institute on an annual budget 
of approximately $125. Some of his 
problems required the use of animals. 


MacLeop 


He could not afford a regular animal 
laboratory; so he bought some chicks. 
Out of his chicken yard came the well- 
known Hackgesetz, the studies of chick- 
ens reared in isolation, the studies of 
“counting” behavior in chickens, and 
the experiments that led to the “avid- 
ity” theory of appetite. While in Eng- 
land, lacking an adequate laboratory, 
he pursued his tactual researches by 
undertaking some assignments for the 
flour millers, who were concerned about 
the elasticity of their dough. When he 
arrived in Sweden, he was assigned a 
small apartment as a laboratory. The 
kitchen promptly became a workshop, 
the bathroom became a_ photographic 
darkroom, a fifteen-year-old boy served 
as technician, and with cardboard, 
thumbtacks, bathroom scales, and sticks 
of wood, the laboratory began to pro- 
duce research. When one thinks of 
David Katz, one wonders sometimes 
whether handsome budgets are a hin- 
drance or an aid to productivity. 

The frustrated graduate student in 
search of a doctoral problem has but 
to thumb through a few of Katz’s pub- 
lications to find a wealth of inviting 
questions and challenging hypotheses 
that will draw him straight to the lab- 
oratory. The human hand as a unitary 
sense organ analogous to the eye, the 
composite photograph as a device for 
the study of group characteristics, the 
sensory basis of the phenomenon of 
elasticity, the phantom limb of the am- 
putee, the ability of certain deaf people 
to appreciate music, and a host of other 
apparent byways of psychological in- 
vestigation were opened up by Katz 
and redirected towards the central prob- 
lem. It was characteristic of his rest- 
less curiosity, however, that he was fre- 
quently content to blaze the trail, be- 
queathing to another generation the task 
of exploiting it. 

The unity within Katz’s apparent di- 
versity of interest is to be found in his 





Davip Katz 


consistent application of the phenom- 
enological method. He was interested 
in the prediction and control of behav- 
ior, in the social and biological deter- 
minants of behavior, in the tricky prob- 
lems of instrumentation, in the broader 
problems of psychological theory, but 
behind it all was a persistent, a pas- 
sionate curiosity about the world of 
phenomena. For Katz the most fasci- 
nating thing to wonder about was a hu- 
man experience. It might be a simple 
color or sound, or the strange beauty of 
an EF] Greco picture, or the peculiar sen- 
sations that accompany the crunching of 
a nut between the teeth, or the ineffable 
satisfyingness of a cool draught of beer 
on a warm day. All experience was 
something to appreciate and to wonder 
about. For him the first task of the 
psychologist—not really a task, but a 
pleasure—was to observe and describe 
without bias both the salient character- 
istics and the subtle nuances of ordi- 
nary human experience. Phenomenol- 


ogy for him was essentially an attitude 


of “disciplined naiveté.” From descrip- 
tive analysis one proceeds to experiment 
and to theory, but no psychological the- 
ory, he argued, could be complete if it 
excluded any of the essential variables 
of human experience. 

Katz’s psychological phenomenology 
is best exemplified in his studies of color 
and touch, Die Erscheinungsweisen der 
Farben (1911') and Der Aufbau der 
Tastwelt (1925). Influenced by the 
physiologist Hering and the _philoso- 
pher Husserl he insisted that the psy- 
chologist should begin by deliberately 
“bracketing” his physical, physiological, 
and philosophical biases and attempt to 
observe phenomena as they are actually 
presented. The phenomenal world thus 
viewed contains properties and relation- 
ships that escape the notice of the phys- 

1 Later revised as Der Aufbau der Farbwelt 


(1930); abridged and translated into English 
as The World of Colour (1935). 


ically or physiologically oriented ob- 
server. The classical psychologist was 
content to order colors in terms of hue, 
brightness, and saturation; Katz saw 
them also varying in mode of appear- 
ance, pronouncedness, insistence, trans- 
parency, inherence, and stability. Clas- 
sical psychology was busily mapping the 
patterns of pressure, pain, warm, and 
cold spots on the skin, and searching for 
receptors; Katz went further, and ex- 
plored the active process of “touching” 
(tasten), discovering here, too, modes 
of appearance, properties of organiza- 
tion, and unsuspected kinds of sensitiv- 
ity. It is unfortunate that, while his 
visual studies have been widely appre- 
ciated, his richly suggestive book on the 
world of touch has received relatively 
little notice. 

During recent years the word phe- 
nomenological has tended to expand its 
meaning. It is coming to suggest an 
easy-going, intuitive, sympathetic ‘“see- 
ing the world as the other fellow sees 
it,’ an approach that permits one to 
take things at their face value and to 
avoid the rigors of experimentation and 
theory construction. This is definitely 
not the kind of psychological phenom- 
enology that Katz advocated. True, he 
was interested in the “fuzzy” aspects of 
experience; but for him the “fuzziness” 
of a phenomenon was no excuse for care- 
less observation or undisciplined think- 
ing. Good phenomenology, he held, re- 
quires at least as much training and 
discipline as does good Titchenerian in- 
trospection. Nor does phenomenology 
lead away from experimentation and 
theory; it is an essential first step in 
the direction of more imaginative ex- 
perimentation and sounder theory. 

Katz adhered to no “school” of psy- 
chology, nor—which is strange in a 
German of his generation—did he ever 
attempt to found a school. In his sym- 
pathies he stood closest to the Gestalt 
theorists; indeed, his pioneer work on 





4 ROBERT B. 


phenomenal constancy must be regarded 
as basic to the Gestalt theory of percep- 
tion, and his more recent experiments on 
thinking belong in the Gestalt tradition. 
His interests were too varied, however, 
to fit neatly within any formal system, 
and we find him in his Gestalt psycholo- 
gie (1944) expressing impatience with 
the narrowness of the Gestalt approach. 
Like Stern he believed that every part 
process must be understood ultimately 
in terms of the total person, but he 
lacked Stern’s compulsion to turn his 
personalism into a philosophy. With 
Jaensch he shared an interest in the pos- 
sibilities of typology, but for him typol- 
ogy was a problem for research rather 
than a revelation. He found merit in 


the developmental approach, both onto- 
genetic and phylogenetic, but he re- 
jected the extremes of both nativism and 
He was willing to accept 


empiricism. 


MacLeop 


physiological evidence and to do physio- 
logical experiments when he felt that 
such would help to clarify a psychologi- 
cal problem, but he refused to accord to 
physiological constructs any unique ex- 
planatory value. 

It is perhaps best to think of Katz as 
essentially a pioneer, catholic rather 
than eclectic, ready to adapt to his pur- 
poses any tool, material or conceptual, 
that looks useful, but never forgetting 
the purpose for which he has selected 
it. For Katz there was a single purpose 
that persisted throughout his scientific 
life. It was, to put it in old-fashioned 
language, to understand the phenomena 
of the human mind. Those who see as 
a challenge to science all the phenomena 
of human mentality will find in Katz a 
kindred spirit. 

RoBert B. MacLeop 


Cornell University 





Psychological Review 
Vol. 61, No. 1, 1954 


THE PHYSIOLOGY OF MOTIVATION 


ELIOT STELLAR 


The Johns Hopkins University 


In the last twenty years motivation 
has become a central concept in psy- 
chology. Indeed, it is fair to say that 
today it is one of the basic ingredients 
of most modern theories of learning, per- 
sonality, and social behavior. There is 
one stumbling-block in this noteworthy 
development, however, for the particu- 
lar conception of motivation which most 
psychologists employ is based upon the 
outmoded model implied by Cannon in 
his classical statement of the local the- 
ories of hunger and thirst (23). Can- 


non’s theories were good in their day, 
but the new facts available on the physi- 
ological basis of motivation demand that 
we abandon the older conceptualizations 
and follow new theories, not only in the 
study of motivation itself, but also in 


the application of motivational concepts 
to other areas of psychology. 

This argument for a new theory of 
motivation has been made before by 
Lashley (42) and Morgan (47). But 
it is more impelling than ever today 
because so much of the recent evidence 
is beginning to fit into the general the- 
oretical framework which these men 
suggested. Both Lashley and Morgan 
pointed out that the local factors pro- 
posed by Cannon (e.g., stomach con- 
tractions or dryness of the throat) are 
not necessary conditions for the arousal 
of motivated behavior. Instead, they 
offered the more inclusive view that a 
number of sensory, chemical, and neural 
factors cooperate in a complicated phys- 
iological mechanism that regulates moti- 
vation. The crux of their theory was 
described most recently by Morgan as 
a central motive state (c.m.s.) built up 
in the organism by the combined influ- 
ences of the sensory, humoral, and neu- 


ral factors. Presumably, the amount of 
motivated behavior is determined by the 
level of the c.m.s. 

Beach (8, 11), in his extensive work 
on the specific case of sexual motivation, 
has amply supported the views of Lash- 
ley and Morgan. But the important 
question still remains: Do other kinds 
of motivated behavior fit the same gen- 
eral theory? As you will see shortly, a 
review of the literature makes it clear 
that they do. As a matter of fact, there 
is enough evidence today to confirm and 
extend the views of Lashley, Morgan, 
and Beach and to propose, in some de- 
tail, a more complete physiological the- 
ory of motivation. 

There are a number of ways to pre- 
sent a theoretical physiological mecha- 
nism like the one offered here. Perhaps 
the best approach is to start with an 
overview and summarize, in a schematic 
way, the major factors at work in the 
mechanism. Then we can fill in the de- 
tails by reviewing the literature relevant 
to the operation of each factor. Some 
advantage is lost by not taking up the 
literature according to behavioral topics, 
that is, different kinds of motivation. 
But the procedure adopted here lets us 
focus attention directly on the theory 
itself and permits us to make some very 
useful comparisons among the various 
kinds of motivation. Once the theoreti- 
cal mechanism and the evidence bearing 
on it are presented, the final step will 
be to evaluate the theory and show what 
experiments must be done to check it 
and extend it. 


THEORETICAL SCHEME 


A schematic diagram of the physio- 
logical mechanism believed to be in con- 





6 Ex.iot STELLAR 


trol of motivated behavior is shown in 
Fig. 1. The basic assumption in this 
scheme is that the amount of motivated 
behavior is a direct function of the 
amount of activity in certain excitatory 
centers of the hypothalamus. The ac- 
tivity of these excitatory centers, in 
turn, is determined by a large number 
of factors which can be grouped in four 
general classes: (a) inhibitory hypo- 
thalamic centers which serve only to 
depress the activity of the excitatory 
centers, (b) sensory stimuli which con- 
trol hypothalamic activity through the 
afferent impulses they can set up, (c) 
the internal environment which can in- 
fluence the hypothalamus through its 
rich vascular supply and the cerebro- 
spinal fluid, and (d) cortical and tha- 
lamic centers which can exert excitatory 
and inhibitory influences on the hypo- 
thalamus. 

As can be seen, the present theory 
holds that the hypothalamus is the seat 
of Morgan’s c.m.s. and is the “central 
nervous mechanism” Lashley claimed 
was responsible for “drive.” Identify- 
ing the hypothalamus as the main inte- 
grating mechanism in motivation makes 
the experimental problem we face more 


CORTEX @ THALAMUS 
SERIAL - 4 


AROUSAL 
ORGANIZATION \ / OF PATTERN 
OF PATTERN 
\ ar? 
\ j / \ 
\ . \ 


/ 


\ 

\ é \ 
HYPOTHALAMUS 

tA ————— | 

/ } 


\ 
8 enn r 
INTERNAL - o—___ SENSORY _ST!MUL 
_ UNL teento 
FACTORS . - re 
. AF LE aeneo 


CHEMICAL A PHYSICA 


HORMONES 
Moco Tew 
OSMOTIC PRESS 
omvas 


t 


| 
4 
| FINAL COMMON PATH 
\ FOR BEHAVIOR } 
i ian ial / 
/ FeeoBacK FROM 
4 \ F x wMarTorR: 
ed a “A a 





REGULATION 
OF INTERNA 


BALANCE nail 


Fic. 1. Scheme of the physiological factors 
contributing to the control of motivated be- 
havior. (See text.) 


specific and more concrete than ever 
before. But it also makes it more 
complicated, for the physiological con- 
trol of the hypothalamus is exceedingly 
complex. The influence of the internal 
environment on the hypothalamus is 
changing continuously according to nat- 
ural physiological cycles, and of course 
it may often be changed directly by 
the chemical and physical consequences 
of consummatory behavior (see Fig. 1). 
Sensory stimuli may also have varied 
effects on the hypothalamic mechanism, 
depending upon their particular pattern, 
previous stimulation, previous learning, 
sensory feedback from the consumma- 
tory behavior itself, and the influence 
the internal environment has already ex- 
erted on the hypothalamus. Similarly, 
the influence of the cortex and thalamus 
will add to the hypothalamic activity 
already produced by sensory stimuli and 
the internal environment. Presumably, 
these cortical and thalamic influences 
may result directly or indirectly from 
sensory stimulation, but they may also 
be controlled partly by the “upward 
drive” of the hypothalamus itself (43). 
Then. to complicate the picture even 
more, there are the inhibitory centers 
of the hypothalamus which are also con- 
trolled by the various internal changes, 
sensory stimuli, and cortical and tha- 
lamic influences. These centers, pre- 
sumably, depress the activity of the ex- 
citatory centers and, therefore, attenu- 
ate their output. 

Fortunately, this mechanism is not 
as formidable against experimental at- 
tack as it might appear. The basic ex- 
perimental approach is to isolate the 
controlling factors in any type of mo- 
tivation and determine their relative 
contributions to hypothalamic activity. 
As you will see, a number of experimen- 
tal techniques like sensory deprivation, 
hormone and drug administration, corti- 
cal ablation, and the production of sub- 
cortical lesions may be used fruitfully 





THE PHYSIOLOGY OF MOTIVATION 


to isolate these factors. But that is only 
half the problem. Obviously, the fac- 
tors controlling hypothalamic activity 
and motivation do not operate in isola- 
tion. In fact, it is quite clear that their 
influences interact. Therefore, it be- 
comes an equally important problem to 
determine the relative contribution of 
each factor while the others are operat- 
ing over a wide range of variation. 


EXPERIMENTAL EVIDENCE 


Before going into the literature bear- 
ing on the operation of each of these 
factors in control of motivated behav- 
ior, it will help to raise a few questions 
that ought to be kept in mind while 
considering the experimental evidence. 
Are there different hypothalamic centers 
controlling each kind of motivation? 
Does the hypothalamus exert its influ- 
ence through direct control of the final 
effector pathways or does it simply have 
a “priming” effect on effector paths con- 
trolled by other parts of the nervous 
system? Do all these factors operate 
in the control of each type of motiva- 
tion or are there cases where sensory 
stimuli, for example, may not be impor- 
tant or where changes in the internal 
environment do not contribute? Can 
the same mechanism describe the con- 
trol of motivation measured by simple 
consummatory behavior, preference, and 
learning? Are the same mechanisms in- 
volved in the control of simple, biologi- 
cal motives and complex, learned mo- 
tives? 

Hypothalamic centers. Review of the 
literature on the role of the hypothala- 
mus in motivation brings out three gen- 
eral conclusions. (a) Damage to re- 
stricted regions of the hypothalamus 
leads to striking changes in certain kinds 
of motivated behavior. (6) Different 
parts of the hypothalamus are critical 
in different kinds of motivation. (c) 
There are both excitatory and inhibitory 


centers controlling motivation in the 
hypothalamus; that is, damage to the 
hypothalamus can sometimes lead to an 
increase in motivation and sometimes a 
marked decrease. 

The evidence bearing on these three 
points can be summarized briefly. Many 
experiments have shown that restricted 
bilateral lesions of the hypothalamus 
will make tremendous changes in basic 
biological motivations like hunger (16, 
22), sleep (49, 50, 53), and sex (6, 18, 
20). Less complete evidence strongly 
suggests that the same kinds of hypo- 
thalamic integration is also true in the 
cases of thirst (61), activity (35), and 
emotions (5, 62). We have only sug- 
gestive evidence in the case of specific 
hungers (59). 

It is clear that there is some kind of 
localization of function within the hypo- 
thalamus although it is not always pos- 
sible to specify precisely the anatomical 
nuclei subserving these functions. The 
centers for hunger are in the region of 
the ventromedial nucleus which lies in 
the middle third of the ventral hypo- 
thalamus, in the tuberal region (16). 
(See Fig. 2.) Sleep is controlled by 
centers in the extreme posterior (mam- 
millary bodies) and extreme anterior 
parts of the hypothalamus (49, 50). 
The critical region for sexual behavior 
is in the anterior hypothalamus, between 
the optic chiasm and the stalk of the 
pituitary gland (18, 20). The center 
for activity is not clearly established, 
but seems to be adjacent with or over- 
lapping the centers for hunger (35). 
Finally, the centers for emotion are also 
in the vicinity of the ventromedial nu- 
cleus, perhaps somewhat posterior to 
the hunger centers and overlapping the 
posterior sleep center (50, 62). 

In at least two cases it is clear that 
there must be both excitatory and in- 
hibitory centers controlling motivated 
behavior. In the case of hunger, bilat- 
eral lesions in the ventromedial nucleus 





EL1I0oT STELLAR 


ra 
a 
Q 


\ 


Corpus Cailosum 





Fr 





pase 
Olf Bulb 


Hip Gyrus 


Pyr Corter 


Fic. 2. 


Schematic drawing of the hypothalamus and its major neural connections. 


Adapted 


from W. R. Ingram’s diagram in Gellhorn (30) and D. B. Lindsley’s Figure 9 (43). 


Abbreviations and Description of Pathways 


A.C, 

Amyg. 

Ant. 

Cingulate Gyrus 
Dors. Teg. N. 
Fr. Cortex 

GP 


Anterior commissure 
Amygdala 

Anterior thalamic nuclei 
Cortex of cingulate gyrus 
Dorsal tegmental nucleus 
Cortex of frontal lobe 
Globus pallidus 

Hab. Habenular nucleus of thalamus 
Hip. Gyrus Hippocampal gyrus 

IC Inferior colliculus 

Mam. Mammillary nuclei 

Med Dorsal medial thalamic nucleus 
MFB Medial forebrain bundle 
N.V Motor nucleus, Vth nerve 
N.VII Motor nucleus, VIIth nerve 
Olf. Bulb Olfactory bulb 

Opt. X Optic chiasm 

i dl Posterior commissure 

Pit. Pituitary gland 

Py. Paraventricular nucleus 
Pyr. Cortex Pyriform cortex 

Ret. Reticular formation 

sc Superior colliculus 

Sep Septal nuclei 

So. Supraoptic nucleus 

Tub. Tuber cinereum 


near the midline produce a tremendous 
Such a 
center is presumably an inhibitory one 
since removing it leads directly to an 


amount of overeating (3, 16). 


OCOnNnOUhwnNe 


of 


Afferents to Hypothalamus 


. Corticothalamic fibers 

. Frontothalamic fibers 

. Frontoseptal fibers 

. Olfacto-hypothalamic tract 

. Septo-hypothalamic fibers 

. Fornix 

. Mammillothalamic tract 

. Thalamo-hypothalamic fibers 

. Pallido-hypothalamic fibers 

. Sensory systems ascending to thalamus 


10 a. cranial afferents 
10 b. somatic and visceral afferents 
Sensory collaterals to hypothalamus 


. Paraventriculo-supraoptic fibers 


Efferents from Hypothalamus 


3. Supraoptic hypophyseal tract 


increase in eating behavior. 
other hand, lesions 1% to 2 millimeters 
off the midline at the level of the ven- 
tromedial nucleus completely eliminate 


. Mammillohabenular tract 

. Mammillotegmental tract 

. Dorsal longitudinal fasciculus 

. Descending efferents relaying in 


brain stem and 


medulla 


On the 





THE PHYSIOLOGY OF MOTIVATION 


hunger behavior (3, 4). After such le- 
sions animals never eat again, so we 
can call such centers excitatory centers. 
Supporting this interpretation is the 
fact, recently reported, that stimulating 
these lateral centers in the waking cat 
through implanted electrodes results in 
vast overeating (27). The same sort of 
mechanism turns up in the case of sleep. 
In the posterior hypothalamus, in the 
region of the mammillary bodies, there 
are excitatory centers or “waking” cen- 
ters which operate to keep the organism 
awake (49, 50). When they are re- 
moved, the animal becomes somnolent 
and cannot stay awake. In the an- 
terior hypothalamus, around the pre- 
optic nucleus, there is an inhibitory cen- 
ter (49). When that is removed, the 
animal is constantly wakeful. 

So far, only an excitatory center has 
been found in the case of sexua! behav- 
ior. Bilateral lesions anterior to the 
pituitary stalk eliminate all mating be- 
havior (18, 20), but no lesion of the 


hypothalamus has ever been reported 
that resulted in an exaggeration of sex- 


ual motivation. What little we know 
about the center for activity near the 
ventromedial nucleus suggests that it is 
also an excitatory center since lesions 
there produce only inactivity and not 
hyperactivity (35). In the case of 
emotions, the picture is not yet clear. 
Lesions near the ventromedial nucleus 
make cats highly emotional (62), and 
therefore this center must be inhibitory. 
But the lateral regions of the posterior 
hypothalamus seem to be excitatory, for 
lesions there make animals placid (50). 
Furthermore, direct stimulation of these 
posterior regions produces many of the 
signs of rage reactions (52). 

There is some evidence that sheds 
light on how the excitatory and inhib- 
itory hypothalamic centers may coop- 
erate in the regulation of motivation. 
In the clear-cut cases of sleep and hun- 
ger it appears that the inhibitory centers 


operate mainly through their effects 
the excitatory centers. At least 
know that when both centers are 
moved simultaneously the effect is i 
distinguishabie from what happens when 
only the excitatory centers are removed 
(3, 49). So it is convenient for present 
theoretical purposes to think of the in- 
hibitory center as one of the factors 
which influences the level of activity of 
the excitatory center. In fact, to specu- 
late one step further, it is worth suggest- 
ing that the inhibitory centers may con- 
stitute the primary neural mechanism 
regulating the satiation of motivation. 
Sensory stimuli. What effects do sen- 
sory stimuli have upon the hypothala- 
mus and how important are such stim- 
uli in the control of motivation? Some 
answer to the first part of this question 
is given by the schematic outline of hy- 
pothalamic connections shown in Fig. 2. 
Clearly the hypothalamus has a rich 
supply of afferents coming directly or 
indirectly from all the various sense or- 
gans. In fact the diagram is really an 
understatement of hypothalamic con- 
nections because it is an oversimplified 
and conservative representation. Physi- 
ological evidence shows, for example, 
that there must be connections from the 
taste receptors via the solitary nucleus 
of the medulla (36). Also there is evi- 
dence of rich connections from the vis- 
ual system via the lateral geniculate of 
the thalamus (36). There is no doubt 
about the fact that the hypothalamus is 
under very extensive sensory control. 
As to the sensory control of motiva- 
tion, there is excellent reason to believe 
that the stimuli which can set up im- 
pulses in these pathways to the hypo- 
thalamus are of particular importance. 
Perhaps the best example comes from 
the study of sexual behavior (11). The 
consensus of a group of studies on dif- 
ferent mammals is as follows. Sexual 
behavior is not dependent upon any sin- 
gle sensory system. Extirpation of any 





10 EL1ot STELLAR 


one peripheral sense organ has no appre- 
ciable influence on the arousal and exe- 
cution of sexual behavior. If two sen- 
sory avenues are destroyed, however, 
sexual behavior may be eliminated, 
especially in the case of the naive ani- 
mal. With experienced animals, inter- 
estingly enough, it may take destruction 
of three sensory systems. But in nei- 
ther case does it matter what combina- 
tion of sensory systems is eliminated. 
We can conclude, therefore, that it is 
the sum total of relevant sensory im- 
pulses arriving at the central nervous 
system (hypothalamus) that is impor- 
tant in setting off sexual behavior. 
Kleitman’s analysis of sleep and 
wakefulness shows that the same kind 
of sensory control operates in this case 
(38). Wakefulness seems to be de- 
pendent upon the sum total of sensory 
impulses arriving at the waking center 
in the posterior hypothalamus, regard- 
less of the particular sensory systems 
involved. Direct support of this kind 


of view is offered by Bremer’s (14) 
physiological data which showed that 
maintenance of the waking rhythm of 
the brain is less a matter of any par- 
ticular sensory input and more a mat- 
ter of the amount of sensory input. 


What we know about hunger and 
thirst suggests that the amount of mo- 
tivated behavior in these cases should 
be a joint function of sensory impulses 
arising from gastric contractions or dry- 
ness of the throat and taste, tactile, and 
temperature receptors in the mouth. 
Unfortunately we have no sensory depri- 
vation experiments that are a good test 
of this point. But all the evidence on 
the acceptability of foods and fluids 
of different temperatures, consistencies, 
and flavoring suggests the joint opera- 
tion of many stimuli in the control of 
these types of motivation. 

So far, we have mentioned only stim- 
uli which arouse motivation. What 
stimulus changes could reduce motiva- 


tion and perhaps lead to satiation? 
There are three general possibilities: 
(a) a reduction in excitatory stimuli, 
(6) interfering or distracting stimuli 
that elicit competing behavior, and (c) 
“inhibitory” stimuli. It is easy to find 
examples of the first two types of stim- 
ulus changes and to guess their mech- 
anisms of operation in terms of the 
present theory. In the case of ‘‘inhib- 
itory” stimuli, however, all we have is 
suggestive evidence. For example, the 
fact that dogs with esophageal fistulas 
eat (37) and drink (1, 13) amounts 
proportional to the severity of depri- 
vation suggests that the stimuli which 
feed back from consummatory behavior 
might have a net inhibitory effect on 
motivation (see Fig. 1). Furthermore, 
some of the experiments on artificially 
loading the stomach suggest that a full 
gut may result in stimuli which inhibit 
further eating (37) or drinking (2, 
13) over and above the possibility that 
there might be no room left in the stom- 
ach or that gastric contractions are re- 
duced. 

In summary, we can state the follow- 
ing working hypotheses about the sen- 
sory factors which operate in the control 
of motivation. (a) No one sensory ave- 
nue is indispensable in the arousal of 
motivated behavior. Instead, sensory 
stimuli have an additive effect on the 
excitability of the hypothalamus so 
that it is the sum total of relevant 
impulses arriving at the excitatory cen- 
ters of the hypothalamus that determine 
the amount of motivated behavior. (0) 
Judging from the resistance of experi- 
enced animals to the effects of sensory 
deprivation in the case of sexual moti- 
vation, it seems clear that excitatory in- 
fluences in the hypothalamus may be 
exerted by learned as well as unlearned 
stimuli. (c) There are afferent impulses 
to the hypothalamus which have a net 
inhibitory effect on the excitatory cen- 
ters and thus serve to reduce motivation 





THE PHYSIOLOGY OF MOTIVATION 1] 


or produce satiation. The best guess at 
present is that these “inhibitory” stim- 
uli operate by exerting an excitatory in- 
fluence on the inhibitory centers of the 
hypothalamus. Presumably, impulses to 
inhibitory centers have the same kind 
of additive properties as impulses to the 
excitatory centers. 

Internal environment. That the in- 
ternal environment plays an important 
role in certain kinds of motivated be- 
havior is a well-established fact. Two 
basic questions must be asked, how- 
ever, before we can understand much 
about how the internal environment 
does its work. What kinds of changes 
that can occur in the internal environ- 
ment are the important ones in motiva- 
tion? How do changes in the internal 
environment influence the nervous sys- 
tem and, therefore, motivated behavior? 

In terms of the present theory, we 
would expect the internal environment 
to operate in motivation by changing 


the excitability of hypothalamic centers. 
This is a reasonable expectation, for the 
hypothalamus is the most richly vascu- 
larized region of the central nervous 


system (24). Not only that, but the 
hypothalamus is also in direct contact 
with the cerebrospinal fluid in the third 
ventricle. 

The case of sexual behavior again 
makes an excellent example. Experi- 
ments on the spayed, female cat (6, 17) 
and spayed, female guinea pig (28) 
have shown that hypothalamic regions 
must be intact and functioning if in- 
jected sex hormones are to arouse es- 
trous behavior. If a section is made 
through the spinal cord enly rudimen- 
tary fragments of sexual behavior can 
be elicited by appropriate stimulation, 
and injected sex hormones make no con- 
tribution to the response. Essentially 
the same thing is true if the section is 
made high in the hind brain but ex- 
cludes the hypothalamus. When the 
decerebration is just above the hypo- 


thalamus, full estrous reactions can be 
aroused by appropriate stimulation, but 
only if sex hormones have been admin- 
istered. It is clear, then, that not only 
is the hypothalamus the main integrat- 
ing center for sexual reactions, but it 
is also most likely the main site of ac- 
tion of the sex hormones. This point 
is further supported by studies of female 
guinea pigs with pinpoint lesions of the 
anterior hypothalamus. These animals 
fail to show sexual behavior even under 
the influence of massive doses of sex 
hormones (19). 

A very similar mechanism seems to 
be involved in the case of motivated be- 
havior dependent upon the organism’s 
defenses against temperature extremes 
(activity, nesting, hoarding, selection of 
high-calorie diets). We know, for ex- 
ample, that reactions regulating body 
temperature in the face of heat and cold 
are integrated in two separate centers 
in the hypothalamus (15, 51). Lesions 
in the anterior hypothalamus destroy 
the ability to lose heat and. therefore, 
to survive in high temperatures. Pos- 
terior hypothalamic lesions, conversely, 
result in a loss of heat production mech- 
anisms so that the animal succumbs to 
cold. Furthermore, artificially raising 
the temperature of the anterior hypo- 
thalamus will quickly induce heat loss. 
suggesting that normally the tempera- 
ture of the blood may be important 
in activating the hypothalamic mecha- 
nisms (15, 44). Unfortunately our in- 
formation stops here. There are no 
direct physiological studies on the role 
of these temperature-regulatine mecha- 
nisms in the control of motivated be- 
havior like activity, hoarding, nesting, 
or food selection. But it seems clear 
that the temperature of the blood may 
be one of the kinds of changes in the 
internal environment that can affect the 
hypothalamus, and it may be important 
in motivated behavior. 





12 Exot STELLAR 


Ample evidence demonstrates that 
there are important changes in the in- 
ternal environment involved in other 
kinds of motivated behavior. In hun- 
ger it has been shown that chemicals 
like insulin (32, 33, 48) and d-ampheta- 
mine (57) influence the rate of eating. 
It is clear that these chemicals do not 
operate primarily through their effects 
on gastric contractions, but it is only 
by a process of elimination that we can 
guess that their sites of action are in 
the hypothalamus. Supporting this pos- 
sibility is the evidence that there are 
chemoreceptors in the hypothalamus 
which are sensitive to variations in 
blood sugar and important in the regu- 
lation of hunger (45). In the case of 
specific hungers, much evidence shows 
that food preference and diet selection 
depend upon changes in the internal en- 
vironment produced by such things as 
pregnancy, dietary deficiencies, or dis- 
turbances of endocrine glands (54). 
Furthermore there are some preliminary 


experimental data, in the case of salt 


and sugar appetites, to suggest that 
there are separate regulatory centers in 
the hypothalamus which are responsive 
to changes in salt and sugar balance 
(59). Finally, in the case of thirst we 
know that a change in osmotic pressure, 
resulting from cellular dehydration, is 
the important internal change leading 
to drinking behavior (31). We know 
further that in the hypothalamus there 
are nerve cells, called “osmoreceptors,” 
which are extremely sensitive to minute 
changes in osmotic pressure (61). But 
the direct experiment has not been done 
to check whether or not it is these nerve 
cells which are mainly responsible for 
the control of thirst.’ 


1In a recent publication, Anderson of Stock- 
holm has shown that injection of small quan- 
tities of hypertonic NaCl directly into re- 
stricted regions along the midline of the hy- 
pothalamus produces immediate and extensive 
drinking in water-satiated goats. (Anderson, 


Obviously the experimental evidence 
on hunger, specific hunger, and thirst 
is incomplete. But enough of it fits 
into the scheme of the theoretical mech- 
anism proposed here to suggest the real 
possibility that the internal changes im- 
portant in these cases operate largely 
through their effects on the hypothala- 
mus. 

One question still remains. What 
role does the internal environment play 
in the mechanism of satiation? About 
all we have to go on at present is the 
very striking fact from the case of 
specific hungers that vastly different 
amounts of consummatory behavior are 
needed to bring about satiation for dif- 
ferent fooa substances. In vitamin de- 
ficiencies only a few milligrams of sub- 
stance need be consumed to produce 
satiation, whereas in caloric deficiencies 
many grams of carbohydrate, fat, or 
protein must be ingested. Presumably, 
it is not the sensory feedback from con- 
summatory behavior that is important 
in these cases, but rather some inhibi- 
tory effects produced by what is con- 
sumed (Fig. 1). Within the present 
theoretical framework, such inhibitory 
effects could be produced either by de- 
pression of excitatory centers of the 
hypothalamus or by arousal of activity 
in inhibitory centers. The problem is 
an important one and it is wide open 
for study. 

It is clear from the foregoing that 
many types of motivated behavior are 
dependent upon changes in the internal 
environment. Several points are worth 
emphasizing. (a) A variety of kinds 
of changes in the internal environment 
can play a role in the regulation of mo- 
tivation: variation in the concentration 
of certain chemicals, especially hor- 
mones, changes in osmotic pressure, and 


B. The effect of injections of hypertonic 
NaCl-solutions into different parts of the hy- 
pothalamus of goats. Acta Physiol. Scand., 
1953, 28, 188-201.) 





THE PHYSIOLOGY OF MOTIVATION 13 


changes in blood temperature. (6) The 
best hypothesis at present is that these 
internal changes operate by contribut- 
ing to the activity of excitatory hypo- 
thalamic centers controlling motivation. 
(c) An equally important but less well- 
supported hypothesis is that internal 
changes, normally produced by con- 
summatory behavior, operate in the pro- 
duction of satiation by depressing ex- 
citatory centers or arousing inhibitory 
centers of the hypothalamus. 

Cortical and thalamic centers. De- 
spite the heavy emphasis laid upon the 
hypothalamus in this discussion, it is 
obvious that it is not the only neural 
center operating in the control of moti- 
vated behavior. In the first place, some 
of the sensory, motor, and associative 
functions of the cortex and thalamus 
are directly important in motivation 
quite apart from any influence they have 
on the hypothalamus. Secondly, even 
though the hypothalamus may be the 
main integrating center in motivation, 
There 


it does not operate in isolation. 
is much evidence that the hypothalamus 
is under the direct control of a number 
of different cortical and thalamic centers 


(Fig. 2). 

The case of emotions offers the best 
example of how the cortex may operate 
in motivation. According to the early 
work of Bard and his co-workers on the 
production of “sham rage” by decorti- 
cation, it looked as though the entire 
cortex might normally play an inhibi- 
tory role in emotions (5). More recent 
work, however, shows that cortical con- 
trol of emotion is more complicated than 
this. Bard and Mountcastle (7), for 
example, have found that removal of 
certain parts of the old cortex (particu- 
larly amygdala and transitional cortex 
of the midline) produced a tremendous 
increase in rage reactions in cats. On 
the other hand, removing only new cor- 
tex resulted in extremely placid cats. 
Results of work with monkeys (40) and 


some very recent experiments with cats 
disagree somewhat with these findings in 
showing that similar old cortex removals 
lead to placidity rather than ferocity. 
The disagreement is yet to be resolved, 
but at least it is clear that different 
parts of the cortex may play different 
roles in the control of emotion, certain 
parts being inhibitory and others excita- 
tory. 

In the case of sleep, it appears so far 
that the cortex and thalamus play ex- 
citatory roles, perhaps having the effect 
of maintaining the activity of the wak- 
ing center in the posterior hypothala- 
mus. Decortication in dogs, for exam- 
ple, results in an inability to postpone 
sleep and remain awake for very long, 
or, as Kleitman puts it, a return to po.y- 
phasic sleep and waking rhythms (38, 
39). Studies of humans, moreover, 
show that even restricted lesions of the 
cortex or thalamus alone can result in 
an inability to stay awake normally 
(25, 26). But no inhibitory effects of 
the cortex in sleep have yet been un- 
covered. 

In sexual behavior it has been found 
that lesions of the new cortex may inter- 
fere directly with the arousal of sexual 
behavior (9, 11). Large lesions are 
much more effective than small lesions, 
as you might expect. Furthermore, cor- 
tical damage is much more serious in 
male animals than in females and is 
much more important in the sexual be- 
havior of primates than it is in the case 
of lower mammals. On the other hand, 
in connection with studies of the cortex 
in emotions, it has been found that le- 
sions of the amygdala and transitional 
cortex of the midline can lead to height- 
ened sexuality in cats and monkeys (7, 
40). So it looks as though the cortex 
may exert both excitatory and inhibi- 
tory influences in sexual motivation. 

Evidence from other types of moti- 
vated behavior is only fragmentary, but 
it fits into the same general picture. In 





14 EL1I0oT STELLAR 


the case of hunger, it has been reported 
that certain lesions of the frontal loves 
will lead to exaggerated eating behavior 
(41, 55). Hyperactivity may follow 
similar frontal lobe lesions and is par- 
ticularly marked after damage to the 
orbital surface of the frontal lobe (56). 
The frontal areas may also be involved 
in what might be called pain avoidance. 
Clinical studies of man show that lobot- 
omies may be used for the relief of in- 
tractable pain (29). The curious thing 
about these cases is that they still report 
the same amount of pain after operation 
but they say that it no longer bothers 
them. Presumably the frontal cortex 
normally plays an excitatory role in the 
motivation to avoid pain. - 

In all the cases cited so far, the 
anatomical and physiological evidence 
available suggests strongly that the 
main influence of the cortex and thala- 
mus in motivation is mediated by the 
hypothalamus. But we do not yet have 


direct proof of this point and need ex- 


periments to check it. 

Interaction of factors. Up to now, 
we have treated the various factors that 
can operate in the control of motivated 
behavior singly. However, one of the 
main points of the theory proposed here 
is that the various factors operate to- 
gether in the control of motivation. 
Presumably this interaction of factors 
occurs in the hypothalamus and takes 
the form of the “addition” of all ex- 
citatory influences and the “subtrac- 
tion” of all inhibitory influences. Some 
experimental evidence bears directly on 
this point. 

In the case of sexual behavior, for 
example, it is clear that excitatory in- 
fluences of the cortex and hormones are 
additive. After sexual motivation is 
eliminated by cortical damage it may 
be restored by the administration of 
large doses of sex hormones (10). Since 
the hypothalamus is the site of action 
of the sex hormones, it seems likely that 


it is also the site of interaction of the 
influences of the hormones and cortex. 

In a similar way, it looks as though 
the contributions of sensory stimulation 
and sex hormones add in the hypothala- 
mus. Neither hormones nor stimulation 
alone is sufficient to elicit sexual reac- 
tions in most mammals, but the right 
combination of the two will. Still an- 
other example of the addition of ex- 
citatory influences is seen in the study 
of the sexual behavior of the male rab- 
bit. In this case neither destruction of 
the olfactory bulbs nor decortication 
will eliminate mating behavior, but a 
combination of the two operations will 
(21). 

It is very important to know whether 
excitatory, and perhaps also inhibitory, 
influences in other kinds of motivation 
have the same sort of additive properties 
as in sexual behavior. Indirect evidence 
suggests they do, but direct experiments 
of the sort described here are needed to 
check the possibility. 

Most encouraging in this connection 
is that students of instinctive behavior 
in inframammalian vertebrates and in- 
vertebrates have presented considerable 
evidence showing that sensory, chemical, 
and neural influences contribute jointly 
to the arousal of many kinds of moti- 
vated behavior (60). For example, in 
a number of cases it has been shown 
that the threshold for arousing behav- 
ior by various stimuli is lowered con- 
siderably by appropriate changes in 
the internal environment. In fact, in 
the extreme case, when internal changes 
are maximal, the behavior may occur 
in the absence of any obvious stimu- 
iation. Presumably in these cases, as 
in the examples of mammalian moti- 
vation, chemical and neural influences 
contribute to the arousal of some cen- 
tral response mechanism in an additive 
way. 

The role of learning. It is obvious to 
every student of mammalian motivation 





THE PHYSIOLOGY OF MOTIVATION 15 


that learning and experience may play 
extremely important roles in the regu- 
lation of motivated behavior. What 
does this mean in terms of the pres- 
ent physiological theory? Unfortu- 
nately, we cannot specify the mecha- 
nisms through which learning enters 
into the control of motivation because 
we are ignorant of the basic physiology 
of learning. But we can make some 
helpful inferences. 

The basic hypothesis in the present 
theoretical framework is that learning 
contributes to hypothalamic activity 
along with influences from unlearned 
afferent impulses, internal changes, and 
cortical activity. In the case of sexual 
behavior we know that many animals 
learn to be aroused sexually by stimuli 
which were not previously adequate. 
Further, we know that in such experi- 
enced animals it is difficult to reduce 
sexual motivation by eliminating ave- 
nues of sensory stimulation, presumably 
because the extra excitatory effects pro- 
duced by learned stimuli contribute to 
hypothalamic activity along with the 
impulses from unlearned stimuli. Along 
the same lines, it is known that sex 
hormones are relatively unimportant 
in man and in certain of the sub- 
human primates that have learned to 
be aroused by a wide variety of stim- 
uli (12). Again, this may mean that 
the excitatory effects from the learned 
stimuli have added enough to the ef- 
fects of unlearned stimuli to make it 
possible to dispense with the contribu- 
tion of the sex hormones in arousing 
hypothalamic activity. 

The evidence available on learning in 
other types of motivation fits in with 
this general theoretical picture, but di- 
rect physiological experiments have not 
yet carried us beyond the stage of in- 
ference. We know, for example, that 
vitamin-deficient rats can learn to show 
motivated behavior in response to cer- 


tain flavors that have been associated 
with the vitamin in the past (34, 58). 
In fact, for a short while they will even 
pass up food containing the vitamin to 
eat vitamin-deficient food containing the 
flavor. Again, it looks as though flavor 
has become empowered by a process of 
learning to contribute to the excitabil- 
ity of the neural centers controlling 
motivation. 


LIMITATIONS OF THE THEORY 


Like any theoretical approach, the 
physiological mechanism proposed here 
has many limitations. Fortunately none 
of them need be too serious as long as 
it is recognized that the theory is set 
up as a general guide for experiments 
and a framework for further theorizing. 
Obviously the theory is going to have 
to be changed and improved many times 
before it is free of limitations. In this 
spirit it might be said that the limita- 
tions of the theory are not much more 


than those aspects of motivation which 


need research the most. But whether 
we label them limitations or urgent 
areas of research, they deserve explicit 
attention. 

The concept of “center.” Through- 
out this discussion the terms “neural 
center” and ‘hypothalamic center” have 
been used. “Center” is a useful and 
convenient term, but it is also a dan- 
gerous one, for it may carry with it the 
implication of strict localization of func- 
tion within isolated anatomical entities. 
Actually this implication is not  in- 
tended, for it is recognized that localiza- 
tion is a relative matter and that no 
neural mechanism operates in isolation. 
Furthermore, it is also possible that 
there may be no discoverable localiza- 
tion of the neural mechanisms governing 
some types of motivated behavior. The 
theory simply states at the moment that 
the best general hypothesis is that some 
degree of localization of the mechanisms 





16 E.iot STELLAR 


controlling motivation can be found in 
the hypothalamus. 

Execution of motivated behavior. No 
attempt has been made in this discus- 
sion to describe the details of the ef- 
ferent pathways or effector mechanisms 
responsible for the execution of moti- 
vated behavior. Discussion of the path- 
ways has been omitted because we know 
very little about them. About all we 
can do at present is to guess, from ana- 
tomical and physiological studies of hy- 
pothalamic function, that the hypothal- 
amus exerts some kind of “priming” 
effect on effector pathways controlled 
by other parts of the nervous system. 
Perhaps after the relationship of the 
hypothalamus to motivated behavior 
has been more firmly established we can 
profitably turn to the qu2stion of how 
the hypothalamus does its work. 

A second aspect of the execution of 
motivated behavior has been omitted 
for the sake of brevity. We all recog- 
nize that an animal with certain kinds 


of cortical lesions, or deprived of cer- 
tain sensory capacities, may be handi- 
capped in executing motivated behavior 
quite aside from any effects these op- 
erations may have on the arousal of 


motivation. Fortunately most investi- 
gators have been aware of this problem 
and have taken pains to distinguish 
these two effects, focusing their atten- 
tion mainly on the arousal of motiva- 
tion. Some day, however, this theory 
should address the question of what 
neural mechanisms govern the execution 
of motivated behavior. 

General nature of the mechanism. 
For theoretical purposes it has been 
assumed that essentially the same mech- 
anism controls all types of motivated 
behavior. Obviously this is not likely 
to be the case, nor is it an essential 
assumption. In some types of motiva- 
tion only parts of this mechanism may 
be involved, or factors not included in 
the present scheme may operate. For 


example, in some cases the hypothala- 
mus may not be involved at all, or it 
may turn out that there are no inhibi- 
tory centers at work, or that internal 
chemical factors do not contribute sig- 
nificantly. There is no reason why 
we should not be prepared for these 
eventualities. But until specific ex- 
perimental evidence to the contrary 
is forthcoming, the general mechanism 
proposed here still remains as the best 
working hypothesis for any particular 
type of biological motivation. 

Inadequacy of behavioral measures. 
To a large degree the present discussion 
is based upon measures of consumma- 
tory behavior. We all know that the 
various measures of motivation are not 
always in good agreement, so there is 
good possibility that what we say about 
consummatory behavior may not apply 
to motivation measured by other meth- 
ods. In fact, Miller, Bailey, and Ste- 
venson (46) have recently shown that 
whereas rats with hypothalamic lesions 
overeat in the free-feeding situation, 
they do not show a high degree of moti- 
vation when required to overcome some 
barrier to obtain food. 

Confining the present discussion 
mainly to consummatory behavior is 
clearly a weakness. But the logic be- 
hind this limited approach is to work 
out the physiological mechanisms in the 
simplest case first, and then to see how 
they must be revised to fit the more 
complicated cases. 

Complex motivation. It can also be 
argued, of course, that the present the- 
ory is confined to the simple, biological 
motives. Again, it seems eminently ad- 
visable to keep the theory relatively nar- 
row in scope until it is developed well 
enough to permit attack on the more 
complicated, learned motives. 

Comparative approach. No attempt 
has been made here to make it explicit 
how the proposed theory applies to or- 
ganisms representative of different phy- 





THE PHYSIOLOGY OF MOTIVATION 17 


logenetic levels. There are many ob- 
vious advantages to the comparative 
approach, but unfortunately, except for 
the case of sexual motivation, the in- 
formation we have on different species 
is too scattered to be useful. Judging 
from what we have learned from the 
comparative study of sexual motivation, 
however, we can expect the various fac- 
tors governing other types of motivation 
to contribute somewhat differently in 
animals at different phylogenetic levels. 
Certainly learning should be more im- 
portant in primates than in subprimates, 
and the contributions of the cortex and 
thalamus should be greater. Much will 
be gained if future research in motiva- 
tion follows the excellent example set 
in the study of sexual behavior and pro- 
vides the much needed comparative 
data. 


ADVANTAGES OF THE THEORY 


On the assumption that none of these 
limitations of the theory are critical, 


it is appropriate to ask: What is 
gained by proposing an explicit the- 
ory of the physiological mechanisms 
underlying motivated behavior? There 
are many positive answers to this ques- 
tion, and we can list some of them 
briefly. 

Simplification of the problem. One 
of the main advantages of the theoreti- 
cal mechanism proposed here is that it 
brings together, into one general frame- 
work, a number of different kinds of 
motivation that have been studied sepa- 
rately in the past. Certainly the the- 
ory encompasses the basic facts avail- 
able on sex, hunger, specific hunger, 
thirst, sleep, and emotion. And it may 
also be able to handle the facts of pain 
avoidance, hoarding, nesting, maternal 
behavior, and other types of so-called 
instinctive behavior. As you have seen, 
one of the benefits deriving from this 
kind of simplification of the problem of 
motivation is the possibility of speeding 


up progress by applying what has been 
learned about physiological mechanisms 
from the study of one kind of motiva- 
tion to the study of other kinds of moti- 
vation. Not only that, but the assump- 
tion that the hypothalamus is central 
in the control of all types of motivation 
may make it easier to explain the vari- 
ous types of interaction among motiva- 
tions that have shown up in many 
studies of behavior. 

Multifactor approach. Another ad- 
vantage of the present theory is that it 
gives strong emphasis to the view that 
motivation is under multifactor control. 
Single-factor theories, so prevalent since 
the days of Cannon, can only lead to 
useless controversies over which factor 
is the “right” one and must always 
be guilty of omission in trying to ac- 
count for the control of motivation. 
Of course, it must be stressed that the 
aim of the multifactor approach is not 
simply to list the many possible factors 
operating in motivation, but rather to 
get down to the concrete experimental 
task of determining the relevant factors 
which control motivation and the rela- 
tive contribution of each. 

Satiation of motivation. Unlike most 
previous theories of motivation, the 
mechanism proposed here attempts to 
account for the satiation of motivation 
as well as its arousal. In terms of the 
present theory satiation is determined 
by the reduction of activity in the main 
excitatory centers of the hypothalamus. 
More specifically, it looks as though the 
inhibitory centers of the hypothalamus 
may constitute a separate “satiation 
mechanism” which is the most impor- 
tant influence in the reduction of the 
activity of the excitatory centers. The 
possibility is an intriguing one, and it 
can be directly explored by experiment. 

Peripheral and central control. In 
the past the study of motivation has 
been hampered by the controversy over 
whether behavior is centrally or periph- 





18 ELIot STELLAR 


erally controlled. The controversy is 
nonsense. The only meaningful experi- 
mental problem is to determine how the 
central and peripheral, or sensory, fac- 
tors operate together in the control of 
behavior. It is this problem which the 
present theory addresses directly, and 
this is one of its greatest strengths. 

Learned and innate control. The 
present theory avoids another knotty 
controversy by directly addressing ex- 
perimental problems. Much time has 
been lost in psychology, and particularly 
in the study of motivation, in arguments 
over whether behavior is primarily in- 
nate or instinctive or whether it is pri- 
marily learned or acquired. The an- 
swer is obviously that it is both, and 
again the only meaningful experimental 
problem is to determine the relative con- 
tribution of each type of control. As 
far as the mechanism proposed here is 
concerned, both innate and learned fac- 
tors make their contributions to the con- 
trol of the same hypothalamic centers. 
There is still much work needed to de- 
termine the details of the mechanisms 
of operation, particularly of the learned 
factors, but some headway has been 
made and the problem is clearly set. 

Explicit nature of the theory. Fi- 
nally, a number of advantages derives 
simply from having an explicit state- 
ment of an up-to-date, physiological the- 
ory of motivation. In the first place, 
an explicit theory can serve as a con- 
venient framework within which to or- 
ganize the physiological facts we already 
have at our disposal. Second, the sys- 
tematic organization of the facts sharply 
points up many of the gaps in our 
knowledge and suggests direct experi- 
ments that should be done in the inves- 
tigation of motivated behavior. Third, 
an up-to-date, systematic theory pro- 
vides a useful and reasonably clear con- 
ceptualization of motivation for psy- 
chologists working in other areas of 
research. 


SUMMARY AND CONCLUSIONS 


A physiological theory of motivated 
behavior is presented. The basic as- 


sumption in this theory is that the 
amount of motivated behavior is a func- 
tion of the amount of activity in certain 
excitatory centers of the hypothalamus. 
The level of activity of the critical hy- 
pothalamic centers, in turn, is governed 
by the operation of four factors. 


1. Inhibitory centers in the hypothal- 
amus directly depress the activity of the 
excitatory centers and may be responsi- 
ble for the production of satiation. 

2. Sensory stimuli set up afferent 
impulses which naturally contribute to 
the excitability of the hypothalamus or 
come to do so through a process of 
learning. 

3. Changes in the internal environ- 
ment exert both excitatory and inhibi- 
tory effects on the hypothalamus. 

4. Cortical and thalamic influences 
increase and decrease the excitability 
of hypothalamic centers. 

Detailed experimental evidence is 
brought forward to show how these 
various factors operate in the manage- 
ment of different kinds of motivated 
behavior. The over-all scheme is shown 
diagrammatically in Fig. 1. 

Out of consideration of this evidence 
a number of hypotheses are generated 
to fill in the gaps in experimental knowl- 
edge. All these hypotheses are experi- 
mentally testable. The ones of major 
importance can be given here as a sum- 
mary of what the theory states and a 
partial list of the experiments it sug- 
gests. 


1. There are different centers in the 
hypothalamus responsible for the con- 
trol of different kinds of basic motiva- 
tion. 

2. In each case of motivation, there 
is one main excitatory center and one 
inhibitory center which operates to de- 





THE PHYSIOLOGY OF MOTIVATION 19 


press the activity of the excitatory cen- 
cer. 

There is already much experimental 
evidence supporting these two general 
hypotheses, but it is not certain that 
they apply fully to all types of basic 
biological motivation. The hypotheses 
should be checked further by deter- 
mining whether changes in all types 
of motivation can be produced by lo- 
cal hypothalamic lesions and whether 
both increases and decreases in moti- 
vation can always be produced. 

3. The activity of hypothalamic cen- 
ters is, in part, controlled by the excita- 
tory effects of afferent impulses gen- 
erated by internal and external stimuli. 

4. Different stimuli contribute differ- 
ent relative amounts to hypothalamic 
activity but no one avenue of sensory 
stimulation is indispensable. 

5. It is the sum total of afferent im- 
pulses arriving at the hypothalamus that 
determines the level of excitability and, 
therefore, the amount of motivation. 


The neuroanatomical and neurophysi- 
ological evidence shows that the hypo- 
thalamus is richly supplied with affer- 
ents coming directly and indirectly from 


all the sense organs (Fig. 2). The be- 
havioral evidence, furthermore, strongly 
suggests that motivation is never con- 
trolled, in mammals at least, by one 
sensory system, but rather is the com- 
bination of contributions of several sen- 
sory systems. Sensory control and sen- 
sory deprivation experiments are needed 
to check this point in the case of most 
kinds of biological motivation, particu- 
larly hunger, thirst, and specific hun- 
gers. 

6. A variety of kinds of physical and 
chemical changes in the internal en- 
vironment influences the excitability of 
hypothalamic centers and, therefore, 
contributes to the control of motivation. 

The evidence shows that the hypo- 
thalamus is the most richly vascularized 
region of the central nervous system and 


is most directly under the influence of 
the cerebrospinal fluid. Furthermore, it 
is clear that changes in the internal en- 
vironment produced by temperature of 
the blood, osmotic pressure, hormones, 
and a variety of other chemicals are 
important in motivation and most likely 
operate through their influence on the 
hypothalamus. Direct studies are still 
needed in many cases, however, to show 
that the particular change that is im- 
portant in motivation actually does 
operate through the hypothalamus and 
vice versa. 

7. The cerebral cortex and thalamus 
are directly important in the temporal 
and spatial organization of motivated 
behavior. 

8. Different parts of the cortex and 
thalamus also operate selectively in the 
control of motivation by exerting ex- 
citatory or inhibitory influences on the 
hypothalamus. 

Tests of these hypotheses can be car- 
ried out by total decortication, partial 
cortical ablations, and local thalamic 
lesions. It should be especially in- 
structive to see what effects cortical and 
thalamic lesions have after significant 
changes in motivation have been pro- 
duced by hypothalamic lesions. 

9. Learning contributes along with 
other factors to the control of motiva- 
tion, probably through direct influence 
on the hypothalamus. 

10. The relative contribution of 
learning should increase in animals 
higher and higher on the phylogenetic 
scale. 

A whole series of experiments is 
needed here. Particularly, there should 
be comparisons of naive and experienced 
animals to determine the relative effects 
of sensory deprivation, cortical and tha- 
lamic damage, and hypothalamic le- 
sions. Presumably animals that have 
learned to be aroused to motivated be- 
havior by previously inadequate stimuli 
should require more sensory deprivation 





20 ELi0ot STELLAR 


but less cortical and thalamic damage 
than naive animals before motivation is 
significantly impaired. 

11. The various factors controlling 
motivation combine their influences at 
the hypothalamus by the addition of 
all excitatory influences and the sub- 
traction of all inhibitory influences. 

Some experiments have already been 
done in the study of sexual motivation 
to show that motivation reduced by the 
elimination of one factor (cortical le- 
sions) can be restored by increasing the 
contribution of other factors (hormone 
therapy). Many combinations of this 
kind of experiment should be carried 
out with different kinds of motivated 


behavior. 

A number of the limitations and some 
of the advantages of the present theo- 
retical approach to the physiology of 
motivation are discussed. 


REFERENCES 


. Apotpu, E. F. The internal environment 
and behavior. Part III. Water con- 
tent. Amer. J. Psychiat., 1941, 97, 
1365-1373. 

. Apotpy, E. F. Thirst and its inhibition in 
the stomach. Amer. J. Physiol., 1950, 
161, 374-386. 

. Ananp, B. K., & Bropeck, J. R. Hypo- 
thalamic control of food intake in rats 
and cats. Yale J. Biol. Med., 1951, 24, 
23-140. 

. Ananpo, B. K., & Bropeck, J. R. Locali- 
zation of a “feeding center” in the hy- 
pothalamus of the rat. Proc. Soc. exp. 
Biol. Med., 1951, 77, 323-324. 

. Baro, P Central nervous mechanisms for 
emotional behavior patterns in animals. 
Res. Publ. Ass. nerv. ment. Dis., 1939, 
19, 190-218. 

. Baro, P. The hypothalamus and sexual 
behavior. Res. Publ. Ass. nerv. ment. 
Dis., 1940, 20, 551-579. 

. Barp, P., & Mountcastte, V. B. Some 
forebrain mechanisms involved in the 
expression of rage with special refer- 
ence to the suppression of angry be- 
havior. Res. Publ. Ass. nerv. ment. 
Dis., 1947, 27, 362-404. 

. Beacu, F. A. Analysis of factors involved 
in the arousal, maintenance and mani- 


. BreMer, F. 


festation of sexual excitement in male 
animals. Psychosom. Med., 1942, 4, 
173-198. 


. Beacu, F. A. Central nervous mechanisms 


involved in the reproductive behavior 
of vertebrates. Psychol. Bull., 1942, 
39, 200-206. 


. Beacu, F. A. Relative effect of androgen 


upon the mating behavior of male rats 
subjected to forebrain injury or castra- 
tion. J. exp. Zool., 1944, 97, 249-295. 


. Beacu, F. A. A review of physiological 


and psychological studies of sexual be- 
havior in mammals. Physiol. Rev., 
1947, 27, 240-307. 


. Beacu, F. A. Evolutionary changes in the 


physiological control of mating behav- 
ior in mammals. Psychol. Rev., 1947, 
54, 297-315. 


. Bettows, R. T. Time factors in water 


drinking in dogs. Amer. J. Physiol., 
1939, 125, 87-97. 

Etude oscillographique des 
activités sensorielles du cortex cérébral. 
C. r. Soc. Biol., 1937, 124, 842-846. 


. Bropeck, J. R. Regulation of energy 


exchange. In J. F. Fulton (Ed.), A 
textbook of physiology. Philadelphia: 
Saunders, 1950. Pp. 1069-1090. 


. Brospeck, J. R., TepPpERMAN, J., & Lona, 


C. N. H. Experimental hypothalamic 
hyperphagia in the albino rat. Yale J. 
Biol. Med., 1943, 15, 831-853. 


. Bromitey, R. B., & Barp, P. A study of 


the effect of estrin on the responses to 
genital stimulation shown by decapitate 
and decerebrate female cats. Amer. J. 
Physiol., 1940, 129, 318-319. 


. Brooxuwart, J. M., & Dey, F. L. Reduc- 


tion of sexual behavior in male guinea 
pigs by hypothalamic lesions. Amer. J. 
Physiol., 1941, 133, 551-554. 


. Brooxnart, J. M., Dey, F. L., & Ranson, 


S. W. Failure of ovarian hormones to 
cause mating reactions in spayed guinea 
pigs with hypothalamic lesions. Proc. 
Soc. exp. Biol. Med., 1940, 44, 61-64. 


. Brooxnart, J. M., Dey, F. L., & RANson, 


S. W. The abolition of mating behav- 
ior by hypothalamic lesions in guinea 
pigs. Endocrinology, 1941, 28, 561-565. 


. Brooxs, C. M. The role of the cerebral 


cortex and of various sense organs in 
the excitation and execution of mating 
activity inthe rabbit. Amer. J. Physiol., 
1937, 120, 544-553. 


. Brooxs, C. M. Appetite and obesity. 


N. Z. med. J., 1947, 46, 243-254. 


. Cannon, W. B. Hunger and thirst. In 


C. Murchison (Ed.), A handbook of 





THE PHYSIOLOGY OF MOTIVATION 21 


general experimental psychology. Wor- 
cester, Mass.: Clark Univer. Press, 1934. 
Pp. 247-263. 

. Craicre, E. H. Measurements of vascu- 
larity in some hypothalamic nuclei of 
the albino rat. Res. Publ. Ass. nerv. 
ment. Dis., 1940, 20, 310-319. 

. Davison, C., & Demutu, E. L. Disturb- 
ances in sleep mechanism: a clinico- 
pathologic study. I. Lesions at the 
cortical level. Arch. Neurol. Psychiat., 
Chicago, 1945, 53, 399-406. 

. Davison, C., & Demutu, E. L. Disturb- 
ances in sleep mechanism: a clinico- 
pathologic study. II. Lesions at the 
corticodiencephalic level. Arch. Neurol. 
Psychiat., Chicago, 1945, 54, 241-255. 

. Detcapo, J. M. R., & ANnanp, B. K. In- 
crease of food intake induced by elec- 
trical stimulation of the lateral hypo- 
thalamus. Amer. J. Physiol., 1953, 172, 
162-168. 

. Dempsey, E. W., & Rrocn, D. McK. The 
localization in the brain stem of the 
oestrous responses of the female guinea 
pig. J. Neurophysiol., 1939, 2, 9-18. 


29. FREEMAN, W., & Watts, J. W. Psycho- 


surgery. (2nd Ed.) Springfield, Il: 
Charles C Thomas, 1950. 

. GELLHORN, E. Autonomic regulations. 
New York: Interscience, 1943. 

. Gmaman, A. The relation between blood 
osmotic pressure, fluid distribution and 
voluntary water intake. Amer. J. 
Physiol., 1937, 120, 323-328. 

. Grossman, M. I., Cummins, G. M., & Ivy, 
A. C. The effect of insulin on food in- 
take after vagotomy and sympathec- 
tomy. Amer. J. Physiol., 1947, 149, 
100-102. 

. Grossman, M. I., & Stern, I. F. Vagot- 
omy and the hunger producing action 
of insulin in man. J. appl. Physiol., 
1948, 1, 263-269. 


4. Harris, L. J., Cray, J., HArcreaves, F. J., 


& Warp, A. Appetite and choice of 
diet. The ability of the Vitamin B de- 
ficient rat to discriminate between diets 
containing and lacking the vitamin. 
Proc. roy. Soc., 1933, 113, 161-190. 

. HETHERINGTON, A. W., & Ranson, S. W. 
The spontaneous activity and food in- 
take of rats with hypothalamic lesions. 
Amer. J. Physiol., 1942, 136, 609-617. 

. Incram, W. R. Nuclear organization and 
chief connections of the primate hypo- 
thalamus. Res. Publ. Ass. nerv. ment. 
Dis., 1940, 20, 195-244. 

. Janowitz, H. D., & Grossman, M. I. 
Some factors affecting the food intake 


. Kreirman, N. 


of normal dogs and dogs with esopha- 
gostomy and gastric fistula. Amer. J. 
Physiol., 1949, 159, 143-148. 

Sleep and wakefulness. 
Chicago: Univer. of Chicago Press, 
1939. 


. Krerrman, N., & Cammie, N. Studies on 


the physiology of sleep. VI. Behavior 
of decorticated dogs. Amer. J. Physiol., 
1932, 100, 474—480. 


. Krtver, H., & Bucy, P. C. Preliminary 


analysis of functions of the temporal 
lobes in monkeys. Arch. Neurol. Psy- 
chiat., Chicago, 1939, 42, 979-1000. 


. Lancwortnuy, O. R., & RuicntTer, C. P. 


Increased spontaneous activity pro- 
duced by frontal lobe lesions in cats. 
Amer. J. Physiol., 1939, 126, 158-161. 


. Lasuitey, K. S. Experimental analysis of 


instinctive behavior. Psychol. Rev., 
1928, 45, 445-471. 


. Liypstey, D. B. Emotion. In S. S. 


Stevens (Ed.), Handbook of experi- 
mental psychology. New York: Wiley, 
1951. Pp. 473-516. 

Macoun, H. W., Harrison, F., Brospeck, 
J. R., & Ranson, S. W. Activation of 
heat loss mechanisms by local heating 
of the brain. J. Neurophysiol., 1938, 
1, 101-114. 


. Mayer, J., Vitare, J. J., & Bates, M. W. 


Mechanism of the regulation of food 
intake. Nature, London, 1951, 167, 
562-563. 


. Mrrter, N. E., Batrey, C. J., & STEVEN- 


son, J. A. F. Decreased ‘hunger’ but 
increased food intake resulting from hy- 
pothalamic lesions. Science, 1950, 112, 
256-259. 


. Morcan, C. T. Physiological psychology. 


(1st Ed.) New York: McGraw-Hill, 
1943. 


. Morcan, C. T., & Morcan, J. D. Studies 


in hunger. 1. The effects of insulin 
upon the rat’s rate of eating. J. genet. 
Psychol., 1940, 56, 137-147. 


. Nauta, W. J. H. Hypothalamic regula- 


tion of sleep in rats; an experimental 
study. J. Neurophysiol., 1946, 9, 285- 
316. 


. Ranson, S. W. Somnolence caused by 


hypothalamic lesions in the monkey. 
Arch. Neurol. Psychiat., 1939, 41, 1-23. 


. Ranson, S. W. Regulation of body tem- 


perature. Res. Publ. Ass. nerv. ment. 
Dis., 1940, 20, 342-399. 


. Ranson, S. W., Kasat, H., & Macovn, 


H. W. Autonomic responses to elec- 
trical stimulation of hypothalamus, pre- 





E.iot STELLAR 


optic region and septum. Arch. Neurol. 

Psychiat., Chicago, 1935, 33, 467-477. 

. Ranstrim, S. The hypothalamus and 

sleep regulation. Uppsala: Almquist and 

Wiksells, 1947. 

._ Ricuter, C. P. Total self regulatory 
functions in animals and human beings. 
Harvey Lect., 1942-43, 38, 63-103. 

Ricnter, C. P., & Hawkes, C. D. In- 
creased spontaneous activity and food 
intake produced in rats by removal of 
the frontal poles of the brain. J. Neurol. 
Psychiat., 1939, 2, 231-242. 

. Rucn, T. C., & SHenxin, H. A. The re- 

lation of area 13 of the orbital surface 

of the frontal lobe to hyperactivity and 
hyperphagia in monkeys. J. Neuro- 

physiol., 1943, 6, 349-360. 


7. SANGSTER, W., GrossMAN, M. I., & Ivy, 


A. C. Effect of d-amphetamine on 
gastric hunger contractions and food 


. TINBERGEN, N. 


intake in the dog. Amer. J. Physiol., 
1948, 153, 259-263. 


. Scott, E. M., & Verney, E. L. Self selec- 


tion of diet. VI. The nature of ap- 
petites for B vitamins. J. Nutrit., 1947, 
34, 471-480. 


. Sourarrac, A. La physiologie d’un com- 


portement: L’appétit glucidique et sa 
régulation neuro-endocrinienne chez les 
rongeurs. Bull. Biol., 1947, 81, 1-160. 
The study of instinct. 
London: Oxford Univer. Press, 1951. 


. Verney, E. B. The antidiuretic hormone 


and the factors which determine its re- 
lease. Proc. roy. Soc., London, 1947, 
135, 24-106. 


. Wueatitey, M.D. The hypothalamus and 


affective behavior in cats. Arch. Neurol. 
Psychiat., 1944, 52, 296-316. 


(Received February 26, 1953) 





Psychological Review 
Vol. 61, No. 1, 1954 


THE S-R REINFORCEMENT THEORY OF EXTINCTION 


HENRY GLEITMAN, JACK NACHMIAS,'’ AND ULRIC NEISSER? 


Swarthmore College 


Stimulus-response __ reinforcement 
theory as formulated by Hull (10), 
the most highly developed of current 
learning theories, has been the center 
of much debate and controversy. Its 
view of reinforcement has been chal- 
lenged by the latent-learning studies 
(1, 32), its conception of the response 
has been attacked by place-learning 
experiments (27, 33), and its analysis 
of discrimination learning has been 
repeatedly questioned, both by the 
adherents of noncontinuity theories 
(16, 18), and more recently by other 
writers (28). Comparatively little 
attention, however, has been paid to 
its theory of extinction. 

This omission is regrettable, in view 
of the fact that extinction constitutes 
a strategic area for any learning 
theory. In the first place, it repre- 
sents an important phenomenon which 
every theory must at least attempt to 
explain—adaptive behavior presup- 
poses not only the acquisition of ap- 
propriate new responses, but also the 
abandonment of inappropriate old 
ones. Furthermore, theoretical inter- 
pretations of extinction play an im- 
portant part in the explanation of 
other phenomena; thus most S-R 
theorists consider discrimination 
learning to be the result of an inter- 
action between excitatory and_inhibi- 
tory tendencies. 

This paper* will examine the S-R 
reinforcement theory of extinction, 
and will try to show that it suffers 


! Now at Harvard University. 

?Now at Massachusetts 
Technology. 

3 We wish to express our appreciation to Dr. 
Edward Walker for his helpful suggestions and 
criticisms. 


Institute of 


from some serious’ shortcomings. 


Specifically, we believe that Hull's 
theory of extinction does not fit all 
the experimental facts, involves cer- 
tain conceptual difficulties, and gen- 
erates some paradoxical predictions. 


Hvu.u’s THEORY OF EXTINCTION 


Following Hilgard and Marquis (7), 
theories of extinction can be grouped 
into two general categories: interfer- 
ence and adaptation. Interference 
theories, such as Guthrie’s (6) and 
Wendt’s (34), assert that extinction is 
due to the association of interfering 
responses to the conditioned stimulus. 
Adaptation theories, such as Hull’s 
theory of reactive inhibition, assume 
that extinction is caused by an inhibi- 
tory factor generated by the repeated 
elicitation of the response. This in- 
hibitory factor—believed to be analo- 
gous to fatigue—is said to act against 
the further evocation of the response, 
and is usually thought to dissipate 
with time. 

Razran (26) and Hilgard and Mar- 
quis have shown that neither of these 
theories by itself provides an adequate 
explanation of the phenomena of ex- 
tinction. Interference theories fail to 
indicate how the interfering responses 
arise in the first place. They do not 
account for spontaneous recovery, 
although recent attempts in that 
direction have been made by Liber- 
man (19, 20). They are further 
challenged by certain facts concerning 
the rates of conditioning and extinc- 
tion. If extinction were but a mani- 
festation of the conditioning of inter- 
fering responses, then any factor that 
facilitates conditioning should like- 





24 HENRY GLEITMAN, JACK NACHMIAS, AND ULRIC NEISSER 


wise accelerate extinction. In actual 
fact, stimulants increase the rate of 
conditioning and retard extinction 
while depressants retard conditioning 
but accelerate extinction. The nega- 
tive correlation usually found between 
rates of conditioning and rate of 
extinction likewise argues against an 
interference theory (7, p. 119). 

An adaptation theory alone is also 
inadequate. It fails to account for 
the fact that spontaneous recovery is 
usually incomplete, and that repeated 
extinction sessions eventually lead to 
a total lack of recovery. It does not 
explain the stimulus generalization of 
extinction effects nor the phenomenon 
of disinhibition. 

Hull’s theory of extinction (10),‘ 
like that presented by Miller and 
Dollard (24), utilizes both interference 
and adaptation concepts and thus has 
a considerably expanded scope. It 
first postulates the operation of an 
inhibitory factor, reactive inhibition 
or Ir, which tends to counteract the 
further occurrence of the response. 
This factor is assumed to result from 
the elicitation of the response itself, to 
vary with the effort involved in the 
performance of that response, and to 
decay with time. On this basis, Hull 
deduces a variety of phenomena such 
as spontaneous recovery, the superi- 
ority of distributed over 
practice, and reminiscence. 

In addition to mere effector inhibi- 
tion (Jr), Hull also postulates that 
extinction involves the production of 
a habit, conditioned inhibition or s/r 
habit of mot responding. Its 
origin is explained as follows: 


massed 


7 


* Recently, Hull’s systematic formulations 
have been revised and elaborated in Essentials 
of Behavior (11), and in A Behavior System 
(12). Since we feel that these more recent 
publications have left the theory of extinction 
essentially unaltered, we shall base our 
discussion primarily upon the more familiar 
Principles of Behavior. 


. . the after-effects cf - »onse evocation 
in the aggregate constitute a negative drive 
strongly akin to tissue injury or “pain.” If 
this is the case, we should expect that the 
cessation of the “nocuous” stimulation in 
question or the reduction in the inhibitory 
substance, or both, would constitute a rein- 
forcing state of affairs. The response process 
which would be most closely associated with 
such a reinforcing state of affairs would 
obviously be the cessation of the activity 
itself. In accordance with the “law of rein- 
forcement”... this cessation of activity 
would be conditioned to any afferent stimulus 
impulse, or stimulus traces, which chanced to 
be present at the time the need decrement 
occurred. Consequently there would arise 
the somewhat paradoxical phenomenon of a 
negative habit, i.e., a habit of not doing 
something” (10, p. 282). 


Being a habit, slr is tied to a 
stimulus, and presumably does not 
dissipate with time. It can thus be 
invoked to explain the generalization 
of extinction along stimulus dimen- 
sions, disinhibition, and the incom- 
pleteness of spontaneous recovery. 

Both inhibitory factors, s7z and Jp, 
contribute to the extinction process by 
summating to make up an inhibitory 
aggregate Ip, which is subtracted from 
reaction potential, spr, to yield 
effective reaction potential, sEp. 
This relation is expressed by the 
following equations: 


Tr - slr + Tr, 


sEr = sEr - Tp 


It is important to note that, accord- 
ing to Hull, both s/Jr and Jp are 
produced during rewarded as well as 
during unrewarded trials; the rise of 
the learning curve during conditioning 
only means that each response leads 
to a greater increment of reaction 
potential than of the inhibitory ag- 
gregate. 

On the surface, Hull’s theory of 
extinction seems to account for many 
of the facts with considerable elegance. 
Nevertheless, we believe that his con- 
ception of the extinctive process is 
beset by serious problems. We shall 





THE S-R REINFORCEMENT 


discuss these problems under three 
headings, in what we feel is the order 
of increasing importance: (a) em- 
pirical difficulties, (b) conceptual diff- 
culties, and (c) some paradoxical 


predictions generated by the theory. 


EMPIRICAL DIFFICULTIES 


According to Hull’s theory of ex- 
tinction, the elimination of a response 
presupposes the performance of the 
response to be eliminated, or at least 
the performance of another response 
from which extinction effects can 
generalize. For both reactive and 
conditioned inhibition depend upon 
response performance, the former 
directly, and the latter indirectly 
through its dependence on Jp reduc- 
tion. There are some experimental 
findings, however, which at least sug- 
gest that the performance of an 
activity is not a necessary condition 
for its extinction. 

1. Subzero extinction. Evidence for 
such a possibility comes first from the 
phenomenon of “‘subzero extinction,”’ 
demonstrated in classical condition- 
ing. Pavlov (25) showed that when 
a conditioned response has been ex- 
tinguished to the point of nonelicita- 
tion, further unreinforced” presenta- 
tions of the conditioned stimulus will 
nevertheless serve to strengthen ex- 
tinction, as measured by a decrease in 
spontaneous recovery. Similar re- 
sults were obtained by Brogden, Lip- 
man, and Culler (2). 

It might be argued that these 
effects are the result of the extinction 
of covert, implicit responses, which 
were elicited even when the overt ones 
were absent. Such an interpretation 
is consonant with the findings of 
Brogden, Lipman, and Culler (2) that 
slight forelimb movements did persist 
into the subzero extinction trials. 
This implies that crucial implicit 
responses or 7,’s survived the elimina- 


THEORY OF EXTINCTION 25 


tion of the overt, ‘“‘parent’’ responses, 
and that they are thus more resistant 
to extinction than are the latter. 

2. Latent extinction. Further evi- 
dence comes from studies reporting an 
effect which might be called ‘‘latent 
extinction” by analogy with the phe- 
nomenon of latent learning. There 
have recently been three experiments 
in this area. 

Seward and Levy (29) trained rats 
to run a straight alley with food on the 
goal platform. Subsequently, the 
animals were extinguished in two 
different ways: The experimental 
group was detained on the now empty 
goal platform both before and between 
extinction trials, whereas the control 
group spent equivalent periods on a 
neutral empty platform. The experi- 
mental group reached the extinction 
criterion in significantly fewer trials, 
and ran more slowly than the control 
group. The effect of previous deten- 
tion on the empty goa! platform ap- 
peared even on the very first extinc- 
tion trial: compared with training 
there was a significant decrease in 
running time for the experimental 
animals after such treatment, whereas 
the corresponding decrease for the 
control animals was not significant. 
This suggests that an instrumental 
response can be extinguished without 
being elicited. 

Bugelski, Coyer, and Rogers (3) 
took issue with the experimental de- 
sign employed by Seward and Levy, 
pointing out that their experimental 
and control animals were detained on 
different platforms even during the 
extinction procedure (between trials), 
so that the test situation was not 
identical for both groups. (We don’t 
entirely understand this objection, 
since Seward and Levy had already 
found a significant difference between 
the running times of their two groups 
on the first test trial.) Upon repeat- 





26 HENRY GLEITMAN, JACK NACHMIAS, AND ULRIC NEISSER 


ing the experiment, Bugelski, Coyer, 
and Rogers failed to obtain any evi- 
dence of latent extinction. There is 
some doubt, however, whether the 
repetition really duplicated the condi- 
tions of the earlier experiment, since 
even the control animals used by 
Seward and Levy gave up running 
sooner than did those employed in the 
replication. Bugelski, Coyer, and 
Rogers suggest that this difference 
may be due to an age factor; the rats 
used in their experiment were younger 
and may have been more active. 

Latent extinction, however, 
also obtained in an experiment by 
Deese (4). He trained rats to run to 
one side of a U maze, and afterwards 
was able to extinguish the correct 
choice response by merely placing the 
animals in the goal box without food. 
Animals who were subjected to this 
nonresponse extinction procedure 
made a smaller proportion of correct 
choices when again run in the maze 
than did control animals who were not 
permitted to “‘inspect’”” the empty 
goal box. Thus, again some extinc- 
tion occurred without the prior per- 
formance of the response to be ex- 
tinguished, and therefore did not seem 
to depend upon _ response-produced 
inhibition. In fact, nonresponse ex- 
tinction was just about as effective as 
response extinction in producing the 
abandonment of the correct response. 
Comparing the results on four ordi- 
nary, nonreinforced response trials 
that were preceded in one group by 
four nonresponse extinction trials, and 
in the other group by four response 
extinction trials, we find hardly any 
difference. In other words, being 
placed in an empty goal box four times 
seems just as effective in reducing the 
likelihood of running subsequently as 
actually having run there on four 
occasions. 

This conclusion is based upon the 


was 


two groups which had eight consecu- 
tive extinction trials on the same day. 
The effect is even more striking with 
groups which were given a 24-hr. rest 
interval between the first four and 
second four extinction trials. In these 
groups, four exposures to the empty 
goal box led to a greater decrement in 
performance, measured on the second 
day, than four nonreinforced runs. 
In other words, response extinction 
showed the effects of spontaneous 
recovery, while ‘‘nonresponse”’ ex- 
tinction did not. 

It might be asserted that these 
results can be explained in a manner 
similar to that in which Spence (31) 
and others (22) have attempted to 
deal with latent learning; that is, by 
reference to fractional anticipatory 
goal responses—r,’s—to whose sen- 
sory consequences the turning or 
running responses had become condi- 
tioned during training, and which 
have become extinguished during the 
latent-extinction period. Such an ex- 
planation does not seem plausible. 

Since the 7, is an implicit response, 
it presumably requires very little 
effort. It follows that many trials 
should be required for its extinction. 
As we have seen, this deduction is 
confirmed by the data from subzero 
extinction, at least if these are to be 
explained by the supposed extinction 
of implicit responses. Yet Deese’s 
animals, given only four trials in the 
empty goal box, nevertheless showed 
extinction effects equal in magnitude 
to those of animals which were re- 
quired to actually run to the goal box. 
If his results are also ascribed to the 
ubiquitous 7,, this now possesses 
somewhat contradictory properties. 

The preceding discussion has not 
exhausted the empirical difficulties 
encountered by Hull’s theory of ex- 
tinction. To give merely two ex- 
amples, the roles played by effort and 





Tue S-R REINFORCEMENT THEORY OF EXTINCTION 27 


by spacing in learning and extinction 
are by no means clear. For reasons 


of brevity, we have confined ourselves 
to the discussion of what we consider 
the most central empirical question: 
Is performance a necessary condition 
for the extinction of a response? 


CONCEPTUAL DIFFICULTIES 


The S-R reinforcement theory of 
extinction has shortcomings more 
serious than the empirical problems 
discussed above. These shortcomings 
become apparent when we try to 
discover how the theory’s central 
constructs are conceptualized. We 
will confine ourselves here to a discus- 
sion of the conditioned inhibition con- 
struct, which seems to pose the most 
serious problems. In so doing, how- 
ever, we do not wish to minimize the 
difficulties involved in the notion of 
reactive inhibition. The latter is usu- 
ally discussed (10, 30) as if it were a 
result of proprioceptive stimulation 
from the specific effectors involved in 
the response, yet the learned response 
is often defined, not in terms of 
specific effectors, but in the broader 
terms demanded by the results of 
place learning (27) and_ response 
generalization (18) experiments. 
Thus, Miller and Dollard define the 
response as “any activity within the 
individual which can become func- 
tionally connected with an antecedent 
event through learning” (24, p. 59). 
A related problem arises from the 
results of Gustafson and Irion (21, 
p. 174) and Kimble (14), who have 
shown a clear reminiscence effect in 
bilateral transfer. They point out 
that if reminiscence is to be ascribed 
to the dissipation of Jr, then Jp must 
inhibit more than a particular, specific 
response. We refrain from extended 
discussion of these matters, however, 
because we feel that the difficulties 


can probably be overcome by clarifica- 
tion of the definitions involved. 

The concept of reactive inhibition 
requires more thorough consideration. 
Originally sZr—the “habit of not 
responding’’—was proposed by Hull 
to account for the more stable aspects 
of extinction, and for the stimulus 
generalization of extinction effects. 
The act of not responding, from here 
on referred to as the “not-response,”’ 
is connected to the stimulus situation 
by the reinforcing effects of Ip reduc- 
tion. The not-response is treated as 
formally equivalent to an ordinary 
response, so that s/p is really an S- 
not-R bond; thus the laws of habit 
formation are widened to include 
extinction. 

In order to ur.derstand what might 
be meant by a “habit of not respond- 
ing,’ we must first be clear on the 
meaning of the not-response itself. 
This consideration is all the more ap- 
propriate since the postulation of not- 
responses that have the same status 
as ordinary responses has become in- 
creasingly widespread in S-R rein- 
forcement theory. For instance, 
Dollard and Miller (5, p. 202), in 
order to subsume repression under 
their general theory of anxiety learn- 
ing, speak of a ‘“‘response of stopping 
thinking,’”’ reinforced by anxiety re- 
duction. 

Unfortunately, S-R theorists are 
somewhat vague in discussing the 
nature of the not-response. Some- 
times Hull seems to identify it with 
the absence of activity, at other times 
with the cessation of activity, and on 
still other occasions he seems to 
assert that s/z is Jp conditioned to the 
stimulus situation (all italics ours): 


Consequently there would arise the some- 
what paradoxical phenomenon of a negative 
habit; 1.e., a habit of not doing something (10, 
p. 282). 


Stimuli and ¢**.nulus traces closely associated 
with the cessation of a given activity, and in 





28 HENRY GLEITMAN, JACK NACHMIAS, AND ULRIC NEISSER 


the presence of Jz from that response, become 
conditioned to this particular non-activity . . 
(11, p. 75). 
The organic process most closely preceding 
the drive reduction would be the cessation of 
the activity itself (11, p. 75). 

. this cessation of activity would become 
(10, p. 282). 
Stimuli closely associated with the acquisition 
and accumulation of inhibitory potential (Jp) 
become conditioned to iz... (10, p. 282). 


conditioned . . 


Miller and Dollard refer to a tendency 
to stop an activity: 

... Thus muscle strain and fatigue are 
drives constantly motivating the subject to 
stop the response he is making; escape from 
muscle strain and fatigue are ever present to 
reward stopping. Extinction occurs unless 
the effects of the drive of fatigue and con- 
sequent reward for stopping are overridden 
by the effects of other stronger drives and 
rewards (24, p. 40). 


From the above array of quotations, 
no clear indication emerges as to just 
what it is that gets conditioned to the 
stimulus in slr. However, these 
statements do suggest a relatively 
small number of alternatives. The 
not-response (that which gets condi- 
tioned to S in sJpr) is either: (a) the 
absence of a particular activity; (0) 
the inhibition—interruption or cessa- 
tion—of a response already in prog- 
ress; or (c) Jr conditioned to the 
stimulus as a learned drive. We shall 
now examine each of these alter- 
natives. 


1. The not-response is the absence of 


a particular activity. According to 
this alternative, the sheer absence of 
a response is that which by Jr reduc- 
tion will be associated with the 
stimulus. This conception is com- 
pletely untenable. For, in this sense 
of not-responding, the animal is per- 
forming innumerable and _ indistin- 
guishable not-responses all the time. 
Simultaneously with not pressing the 
lever in a Skinner box, he is also not 
running a maze, not jumping to a 


black card, and not playing three- 
dimensional chess. As a matter of 
fact, the same infinite set of not- 
responses is also performed when he is 
pressing the lever. Since all of these 
not-responses occur at the time of Jr 
reduction and of reward, they should 
all be conditioned to the stimulus. 
This is clearly absurd. 

2. The not-response ts the inhibition 
—interruption or cessation—of a re- 
sponse already in progress. This al- 
ternative seems to be the one most 
frequently implied by S-R_ theorists. 
Here it is asserted that before the 
not-response can be evoked, the 
response proper must at least have 
begun; that is, the animal always 
starts to press the lever before he 
stops doing so. But, in extinction he 
eventually fails to respond altogether, 
and does not even start to make the 
response, at least overtly. Of course, 
one might again suggest that when no 
overt response is started, there is at 
least an implicit one present, so that 
the conditions for the elicitation of the 
not-response as here conceived are 
met. Such a conception, however, 
raises another problem. 

If implicit responses are to be util- 
ized in S-R_ reinforcement theory, 
many phenomena suggest that these 
responses must be capable of being 
extinguished. As we have already 
seen, the latent-extinction results sug- 
gest such an interpretation. Further- 
more, Hull’s recent treatment of 
secondary reinforcement (11) deals 
with this in terms of fractional re- 
sponses. Since secondary rewards 
can be extinguished, the 7,’s must be 
capable of extinction. Finally, we 
believe that Hull’s theory of problem 
solving (9) can be shown to require 
the possibility of extinguishing im- 
plicit responses. Thus, within the 
context of S-R reinforcement theory, 
r,s must be extinguishable, and since 





THE S-R REINFORCEMENT THEORY OF EXTINCTION 29 


they are conceived as formally akin to 
overt responses, their extinction must 
follow the same laws as those proposed 
for the latter. 

How could such extinction take 
place? There would seem to be a 
need for a not-r, to counteract the 7,. 
But, according to the present alterna- 
tive, such a not-r, can only be evoked 
after the 7, has been initiated. Once 
again, the complete elimination of the 
(implicit) response must be explained, 
not its interruption. Since we are 
already at the level of implicit re- 
sponses, an even more implicit re- 
sponse would have to be set in motion 
for the purpose. This is an utterly 
unpalatable concept. 

The present alternative, then, seems 
to be unsatisfactory. In order to 
explain the total elimination of re- 
sponses, it must resort to the postula- 
tion of implicit responses. When 
called upon to account for the 


extinction of implicit responses, it 


becomes yet more strained. 

3. The not-response is reactive in- 
hibition conditioned to the stimulus. 
Hull (10) sometimes writes of sJpz as 
Ir which has been conditioned to a 
stimulus. This implies that Jp and 
sIp are of the same nature, except 
that in the first case the inhibitory 
force is produced as the direct result 
of effector action, while in the second 
case the identical force is elicited by a 
conditioned stimulus. 

Such an interpretation is formally 
similar to the theory of fear behavior 
suggested by Miller (23). He as- 
sumes that fear is an internal response, 
reflexly connected with pain, which 
can be conditioned to an originally 
neutral stimulus under suitable condi- 
tions. Kimble (13) has criticized the 
interpretation of sZr in such terms. 
He argues that Jz should be treated 
as an intervening variable, and that 
responses, rather than intervening 


variables, become connected to 
stimuli. 

Kimble’s criticism may not be a 
crucialone. S-R theorists have rarely 
hesitated to endow their intervening 
variables with appropriate properties, 
and could perhaps treat Jr as af it 
were a response (or rather, a not- 
response), which can be conditioned 
to stimuli to generate s7r. Even this 
formulation, however, raises some 
problems. If sJpr is thought of as a 
conditioned Jp-response, then sJp and 
Ire must have identical response 
properties. From this point of view, 
the inhibitory processes involved in 
slr and Jp must be the same in all 
respects save the manner in which 
they are aroused. 

Thus, the dissipation of inhibition 
following the withdrawal of a condi- 
tioned inhibitor must follow the same 
temporal course as the dissipation of 
Ir, the response generalization of 
sIp and Ip must be equivalent, and so 
on. We do not know whether S-R 
reinforcement theorists would be pre- 
pared to accept these consequences of 
the present alternative. 

There remains yet a fourth possi- 
bility, that of equating the not- 
response with an actual activity 
antagonistic to the to-be-extinguished 
activity, 1.e., making the not-response 
a bona fide response. The resulting 
theory of extinction would be rather 
close to the interference theories of 
Guthrie (7) and Wendt (34), and at 
the very least would have to face 
many of the objections that have been 
leveled against these. In the absence 
of any evidence that Hull and his co- 
workers had this possibility in mind, 
it will not be considered here. 


PARADOXICAL DERIVATIONS 
FROM THE THEORY 


We have tried to show that the S-R 
reinforcement theory of extinction 





30 HENRY GLEITMAN, JACK NACHMIAS, AND ULRIC NEISSER 


encounters serious empirical prob- 
lems, and contains some important 
conceptual difficulties. One might 
argue that, despite their shortcomings, 
the postulates of the theory permit us 
to deduce a great number of phe- 
nomena of extinction which actually 
occur. Unfortunately, however, they 
also necessitate certain other pre- 
dictions which are clearly false. 


1. Predictions regarding the course of 
learning and extinction. Hull and his 
co-workers believe that habit strength 
becomes asymptotic to a maximum 
value, and they usually assume that 
it does not decay with time. Further- 
more, they assert that Jp and slp 
result as a necessary consequence of 
the evocation of the response, regard- 
less of the absence of 
positive reinforcement. Withholding 


presence Or 


reinforcement leads to extinction only 
indirectly ; when no further increase in 
reaction potential occurs, the inhibi- 


of Ir 


tory action and slr 
unopposed. 

From these assumptions it follows 
that the ordinary learning curve 
should not be monotonically increas- 
ing, but should 
maximum and then eventually return 


to the base line.’ For, as the habit is 


grow 


instead rise to a 


5 Koch (15) notes this point in his review of 
Hull's Principles of Behavior, but does not 
seem to regard it as more than a matter of 
detail. The same problem is recognized by 
McGeoch and Irion (21, p. 55). They sug- 
gest that the situation could be remedied by 
making J subtract from N (the number of 
reinforced trials) rather than from sEr. In 
effect, this would make Jz subtract from sHp. 
In Hull’s system, however, the indestructi- 
bility of s/z and the merely “‘masking"’ roles 
of Ir and slp are essential; for example, 
they are crucial for the derivation of such 
phenomena as spontaneous recovery, remi- 
niscence, and disinhibition. The suggestion 
made by McGeoch and Irion thus amounts to 
a proposal for a radically revised theory, and 
is not specifically relevant to the present 
discussion. 


repeatedly reinforced, sEr approaches 
its asymptote. Once this asymptote 
is approximated, further reinforce- 
ments cannot add any further effective 
increments to the habit strength. 
Only Jrand g/g can then be generated 
to any extent. (That s/Jz is not yet 
at its asymptote is obvious: since 
extinction has not yet occurred, slp 
must be capable of further growth.) 
This means that from here on, further 
reinforcements can only lead to a 
decrement in performance, and will 
eventually cause the total elimination 
of the response. A pause between 
trials may at first lead to some re- 
covery due to Jp dissipation, but this 
recovery will be short lived. Further 
trials must add to gsJr until it is 
approximately equal to sEpr, at which 
point no move recovery can take place. 
The learning curve will have reached 
the base line, never to come up again. 
Necessarily, then, there is no learned 
act which can be performed for any 
length of time; its very repetition— 
regardless of reinforcement—must 
lead to its eventual elimination. 

This prediction is at odds with 
everything we know about the course 
of learning. The phenomenon of 
‘inhibition of reinforcement’ (8) oc- 
curs only under quite special condi- 
tions, and hardly begins to do justice 
to this deduction. The learning curve 
must return to sero, regardless of the 
spacing of trials, and must do so in the 
same number of trials required for 
experimental extinction after sz has 
reached asymptote. One does not 
have to refer to experimental studies 
to demonstrate the fallacy of this 
prediction. Our daily life is full of 
countless activities which we perform 
again and again with no sign of 
decrement. We turn door knobs, 
say “how do you do,” sit down on 
chairs, and recline on beds, and have 
done so since childhood. It is reason- 





Tue S-R REINFORCEMENT THEORY OF EXTINCTION 31 


able to assume that such habits have 
reached asymptotic strength at an 
early age, yet there is no sign of 
decline. 

By the same reasoning, it also 
follows that once a habit has been 
completely extinguished, recondition- 
ing is impossible. For again, assum- 
ing that the habit strength was at 
asymptote prior to extinction, further 
reinforcement—no matter how fre- 
quent or how spaced—cannot add to 
it. This also is contrary to experi- 
mental fact (2) and to common 
observation. 

The deductions just developed lead 
one to suspect that there is some 
serious flaw in the postulates which 
generated them. It seems to us that 
the problems principally derive from 
the assumption that there is no 
qualitative difference between the 
learning and the extinction situations, 
and that nonreinforcement affects 


performance merely by preventing the 


further growth of habit strength. 
According to the theory, an extinction 
trial is but a learning trial without 
reward—or rather with decreased 
reward—since Jr reduction still fur- 
nishes some reinforcement. With- 
drawal of reward produces no real 
change in the situation. Jr and slp 
are generated during learning as well 
as during extinction. We shall now 
try to show that this conception leads 
to yet further paradoxes. 

2. Predictions regarding the im- 
possibility of either learning or extinc- 
tion. In the theory of extinction 
originally proposed by Hull, condi- 
tioned inhibition is a habit established 
by reinforcement due to Jz reduction. 
Whenever an animal performs a 
response, a not-response inevitably 
follows it. Just how the not-response 
is conceived is irrelevant, so long as it 
inevitably occurs subsequent to the 
bona fide response. During the per- 


formance of the not-response, Jp 
dissipates. This results in need re- 
duction, and in turn reinforces the 
connection between the stimulus situ- 
ation and the not-response. Thus 
sIr is built up. The not-response 
opposes the response, eventually lead- 
ing to extinction. 

If we accept this mechanism, we are 
faced with an unpleasant dilemma: 

a. Extinction ts impossible. Before 
not-responding, the animal must nec- 
essarily have responded. If Ip reduc- 
tion is reinforcing, it should reinforce 
the response as well as the not- 
response. If it did, and to the same 
degree, extinction could not take 
place. 

In discussing this problem, Hull (10, 
p. 301) refers to the gradient of 
reinforcement. He points out that 
the not-response is temporally more 
contiguous with the decrease in ‘“‘nocu- 
ous” stimulation and thus to rein- 
forcement, than is the bona fide 
response. In consequence, the former 
should be stamped in more strongly 
than the latter, i.e., the increment in 
sIr should exceed the increment in 
slTp. Inthis manner extinction could 
take place. But this solution forces 
us onto the other horn of the dilemma. 

b. Learning 1s impossible. As we 
have already seen, according to the 
theory, the not-response follows the 
response both during learning and 
during extinction. After pressing the 
lever in a Skinner box, the animal 
must necessarily stop pressing the 
lever (perform the not-response). 
This occurs before he reaches for the 
food pellet. But since the gradient of 
reinforcement—invoked before to 
make extinction possible—applies here 
equally, the not-response should be 
conditioned more strongly to the 
stimulus situation than should the 
response itself. In that case, the 
response can never become effectively 





32 HENRY GLEITMAN, JACK NACHMIAS, AND ULRIC NEISSER 


established, since the increment in 
slg must always be greater than the 
increment in sl/r. Any increase in 
the amount of reinforcement would 
benefit the not-response proportion- 
ately more than the response itself. 
Thus, learning is impossible. 

Hull considers this problem also, 
and suggests a possible solution. He 
argues that the reinforcement in many 
experimental situations is secondary 
in nature, and that “this secondary 
reinforcement, e.g., the click of the 
magazine, occurs during the con- 
traction and before the relaxation”’ 
(10, p. 302). Since reinforcement 
preceding a response is_ relatively 
ineffective, Hull concludes that the 
response would receive a greater bene- 
fit from the secondary reinforcement 
than would the not-response, even 
though the former benefits less from 
primary reinforcement due to Jp 
reduction. In this way, the not- 


response might receive less total rein- 


forcement than the response proper, 
and the s//g increment might out- 
weigh the increase in s/p. 

This suggestion seems inadequate, 
for there is no reason to assume that 
the response is accompanied by more 
secondary reinforcement than is the 
not-response. The effectiveness of 
secondary reinforcers is generally be- 
lieved to be a function of their tem- 
poral proximity to primary need 
reduction. ‘The not-response is neces- 
sarily closer to primary reinforcement 
than is the bona fide response. Hence 
the secondary reinforcement accom- 
panying it should be more, rather than 
less. The occurrence of a consistent 
click at the time of the response is 
merely an artifact of a particular 
experimental condition; surely the rat 
would learn even if the click were 
made contiguous with the not-re- 
sponse. 

We are thus left with a strange 


spectacle: a theory of extinction, 
derived from principles of learning, 
which must deny either the existence 
of learning or of extinction. The 
assumption of continuity between the 
learning and extinction situations— 
the failure to allow for any qualitative 
change brought about by withdrawal 
of reward—appears and less 
tenable. 


less 


SUMMARY 


Any theory of learning must deal 
with the phenomena of extinction as 
well as those of habit formation. S-R 
reinforcement theory, as presented by 
Hull, is one of the most influential of 
modern learning theories. It thus 
seemed appropriate to examine crit- 
ically his treatment of extinction. 
We have tried to show that it faces a 
number of serious difficulties. In 
particular: 


1. Recent experiments in the field 
of “latent extinction” suggest that the 
actual performance of a response may 
not be necessary for its extinction. 

2. Neither reactive nor conditioned 
inhibition is clearly or adequately 
conceptualized. In particular, the 
“habit of not responding” has never 
received a satisfactory definition. 

3. Certain paradoxical — conse- 
quences can be derived from the 
theory: Not only should the learning 
curve inevitably decline to its starting 
point with continuous reinforcement, 
but, in fact, learning should be im- 
possible altogether. 

4. Many of these difficulties stem 
from Hull’s assumption that with- 
drawal of reward introduces nothing 
essentially new to the situation. 


REFERENCES 


1. BLopGcetr, H. C. The effect of the 
introduction of reward upon the maze 
performance of rats. Univer. Calif. 
Publ. Psychol., 1929, 4, 113-134. 





THE S-R REINFORCEMENT THEORY OF EXTINCTION 33 


. BROGDEN, W. J., LipMaANn, E. A., & 

CuLLeR, E. The role of incentive in 

conditioning and extinction. Amer. J. 

Psychol., 1938, 51, 109-117. 

. BuGetskx!, B. R., Cover, R. A., & 
Rocers, W. A. A criticism of pre- 
acquisition and pre-extinction of ex- 
pectancies. J. exp. Psychol., 1952, 
44, 27-30. 

DEEsE, J. The extinction of a dis- 
crimination without performance of the 
choice response. J. comp. physiol. 
Psychol., 1951, 44, 362-366. 

DoLLarRD, J., & Miter, N. E. Per- 
sonality and psychotherapy. New 
York: McGraw-Hill, 1950. 

. Gururig£, E.R. The psychology of learn- 

ing. New York: Harper, 1935. 

. Hircarp, E. R., & Maroguis, D. G. 

Conditioning and learning. New York: 

Appleton-Century, 1940. 

. Hovianp, C. 1. “Inhibition of reinforce- 

ment’’ and phenomena of experimental 

extinction. Froc. nat. Acad. Scti., 

1936, 22, 430-433. 

. Hutt, C. L. The mechanism of the 

assembly of behavior segments in 

novel combinations suitable for prob- 

lem solving. Psychol. Rev., 1935, 42, 

219-245. 

. Hutt, C. L. Principles of behavior. 

New York: Appleton-Century, 1943. 

. Hutt, C. L. Essentials of behavior, 

New Haven: Yale Univer. Press, 1951. 

. Hutt, C. L. A behavior system. New 

Haven: Yale Univer. Press, 1952. 

. Kimsie, G. A. Performance and remi- 
niscence in motor learning as a function 
of the degree of distribution of practice. 
J. exp. Psychol., 1949, 39, 500-510. 

Kims_e, G. A. Transfer of work inhibi- 
tion in motor learning. J. exp. 
Psychol., 1952, 43, 391-392. 

Kocu, S. Review of Hull’s Principles of 
behavior. Psychol. Bull., 1944, 41, 
269-286. 

KRECHEVSKY, I. A study of the con- 
tinuity of the problem solving process. 
Psychol. Rev., 1938, 45, 107-133. 

. LasuLtey, K. S. Studies in cerebral 

functioning in learning. V. The re- 

tention of motor habits after destruc- 
tion of the so-called motor areas in 
primates. Arch. Neurol. Psychiat., 

Chicago, 1924, 12, 249-276. 

. Lasuiey, K. S., & WapE, M. The Pav- 

lovian theory of generalization. Psy- 

chol. Rev., 1946, 53, 72-87. 

. LIBERMAN, A. M. The effect of inter- 

polated activity on spontaneous re- 


covery from experimental extinction. 
J. exp. Psychol., 1944, 34, 282-301. 

. LrperMAN, A. M. The effect of differ- 
ential extinction upon spontaneous 
recovery. J. exp. Psychol., 1948, 38, 
722-733. 

McGeocu, J. A., & Ir1ton, A. L. The 
psychology of human learning. New 
York: Longmans, Green, 1952. 

MEEHL, P. E., & MACCORQUODALE, K. 
A further study of latent learning in 
the T-maze. J. comp. physiol. Psy- 
chol., 1948, 41, 372-3906. 

. Miter, N. E. Studies of fear as an 
acquirable drive: I. Fear as motivation 
and fear-reduction as reinforcement in 
the learning of new responses. J. exp. 
Psychol., 1948, 38, 89-101. 

MILLER, N. E., & DOLLARD, J. Social 
learning and imitation. New Haven: 
Yale Univer. Press, 1941. 

. Pavztov, I. P. Conditioned reflexes. 
London: Oxford Univer. Press, 1927. 

. Razran, G. H. S. The nature of the 
extinctive process. Psychol. Rev., 
1939, 46, 264-297. 

. Ritcute, B. F., AESCHLIMAN, B., & 
Peirce, P. Studies in spatial learn- 
ing: VIII. Place performance and the 
acquisition of place dispositions. J. 
comp. physiol. Psychol., 1950, 43, 73- 
35 


. SALDANHA, E. L., & BITTERMAN, M. E. 
Relational learning in the rat. Amer. 
J. Psychol., 1951, 64, 37-53. 

. SEWARD, J. P., & LEvy, N. Sign learning 
as a factor in extinction. J. exp. 
Psychol., 1949, 39, 660-668. 

. Sotomon, R. L. The influence of work 
on behavior. Psychol. Bull., 1948, 45, 
1—40. 

. SPENCE, K. W. Theoretical interpreta- 
tions of learning. In S. S. Stevens 
(Ed.), Handbook of experimental psy- 
chology. New York: Wiley, 1951. 
Pp. 690-729. 

. THISTLETHWAITE, D. An _ experimental 
test of a reinforcement interpretation 
of latent learning. J. comp. physiol. 
Psychol., 1951, 44, 431-441. 

. ToLMan, E. C., Ritcute, B. F., & KALISH, 
D. Studies in spatial learning. I. 
Orientation and the short-cut. J. exp. 
Psychol., 1946, 36, 13-23. 

. Wenpt, G. R. An interpretation of 
inhibition of conditioned reflexes as 
competition between reaction systems. 
Psychol. Rev., 1936, 43, 258-281. 


(Received December 10, 1952) 





Psyc hologi al Review 
Vol. 61, No. 1, 1954 


PUNISHMENT: I. THE AVOIDANCE HYPOTHESIS 


JAMES A. DINSMOOR 


Indiana University 


A possible reason for the seeming 
neglect of the topic of punishment in 
contemporary behavioral research and 
in most of our handbook and textbook 
presentations may be found in the pres- 
ent entanglement of theoretical treat- 
ments. So confused is the current pic- 
ture that Stone, in a recent review of 
the literature, was led to remark that 
“The task of resolving apparently con- 
flicting results ... is an all but im- 
possible one” (32, pp. 197-198). Ac- 
tually, however, I feel that there is an 
available formulation which can handle 
the bulk of the data and which can in- 
corporate it within a more general de- 
scriptive framework without requiring 
new explanatory principles. I also be- 
lieve that this formulation can be shown 
to be consistent, at least, with those 


special and seemingly contradictory in- 


stances which appear to have been 
widely cited precisely because of the 
difficulties which they offer for any 
form of systematic treatment. I am 
speaking of the proposition that the 
main effects of punishment may be at- 
tributed to the establishment of certain 
avoiding reactions which prevent the 
completion of the original behavioral 
sequence. 

The general suggestion that the ef- 
fects of punishment may be due to 
some form of interfering reaction is by 
no means a new one. It appears as 
early as 1932, in some rather incidental 
comments by Thorndike. At that time, 
Thorndike presented a summary of sev- 
eral studies which seemed to indicate 
that punishment had little or no effect 
on the preceding behavior. However, 
he recognized the necessity of provid- 
ing some kind of an “escape clause” to 


deal with those special, as it seemed to 
him, and anomalous cases where the 
punishment of one response did facili- 
tate the elimination of this response and 
the acquisition of a nonpunished alter- 
native. To deal with such observations 
he offered the suggestion that this ef- 
fect was due, not to a direct weakening 
of the punished response itself—as he 
had previously postulated in the law of 
effect—but to the strengthening of the 
alternative reaction. “The person or 
animal is led by the annoying after- 
effect to do something else to the situa- 
tion” (33, p. 311). Or later, “The idea 
of making [the] response or the impulse 
to make it then tends to arouse a mem- 
ory of the punishment and fear, repul- 
sion, or shame. This is relieved by 
making no response to the situation 

. . or by making a response that is 
or seems opposite to the original re- 
sponse” (34, p. 80). 

Stemming from  Thorndike’s  ap- 
proach, we have such later develop- 
ments as Guthrie’s contiguity inter- 
pretation (e.g., 11), Estes’ “anxiety” 
state (9), and various references to 
“heightened tension” (13, pp. 245-246) 
or to inferred drives of “fear” or ‘“anxi- 
ety” (e.g., 8, 16) which are said to be 
reduced by making an opposing or con- 
flicting response. In particular, sev- 
eral authors have at least mentioned 
an avoidance interpretation, the fullest 
treatments being those by Dollard and 
Miller (8, pp. 75-76), Mowrer (16, pp. 
91, 118, 154, 210, 262 ff.; 17), Mowrer 
and Kluckhohn (18, pp. 80-81), and 
Skinner (31, esp. pp. 188-189); but 
even these are obviously rather brief. 
Furthermore, no attempt has yet been 
made to lay a detailed and comprehen- 





THE AVOIDANCE HYPOTHESIS 35 


sive statement of the hypothesis along- 
side the published findings from empiri- 
cal studies of punishment to see how 
well such a statement fits the known 
facts. 

In this paper I will merely outline 
the hypothesis itself. First, I will re- 
view some of the empirical studies 
of secondary aversive stimulation and 
avoidance training in order to see what 
principles are required for their inter- 
pretation. Next, I will compare the 
experimental operations used in avoid- 
ance training with those used in a 
study of punishment, in a free re- 
sponding situation. Finally, I will try 
to show what should or must happen 
when we apply an aversive stimulus 
following successive instances of a given 
response. 


AVERSIVE STIMULI 


Since the concept of an aversive stim- 
ulus is fundamental to subsequent dis- 
cussion, I will begin by offering a 
definition. I have selected the word 
aversive both for the frequency of its 
appearance in the experimental litera- 
ture and for the strength of its behav- 
ioral connotations in everyday usage. 
(In Webster’s International Dictionary 
| 2nd Ed.], for example, aversion is first 
defined as “act of turning away” and 
avert is further defined as “to cause 
to turn away” or “to ward off, or pre- 
vent, the occurrence or effects of.”) i 
will use the word in a strictly functional 
or behavioral sense, with no reference 
to its subjective properties or to any 
assumed drive which might be said to 
be aroused or reduced by the presenta- 
tion or removal, respectively, of the 
stimulus. It will refer to a class of 
stimuli which are suitable for studies 
of “escape training” (13) or “aversion” 
(14). The critical observation is that 
the reduction or elimination of the stim- 
ulus increases the frequency or proba- 
bility of the preceding behavioral se- 


quence—that is, that it is reinforcing to 
the subject. 

For the naive organism, this classi- 
fication apparently includes such stimu- 
lating events as immersion in water and 
certain intensities of light, sound, tem- 
perature, and electric shock. This is 
not to say, however, that an aversive 
stimulus cannot be stripped of its orig- 
inal properties or that these cannot be 
overlaid by other properties acquired 
through special training or instruction. 
In practice, most of the empirical stud- 
ies on avoidance and punishment have 
been based on the administration of 
shock to rats, although occasional ref- 
erence will be necessary to other stim- 
uli or other organisms. 


How NEUTRAL STIMULI BECOME 
AVERSIVE 


What happens when a neutral or in- 
effective stimulus is paired with one 
which is already aversive to the sub- 
A relatively clear 


ject, such as shock? 
and simple answer to this question may 
be found in an experiment by Brown 


and Jacobs (2, Experiment II). The 
apparatus consisted of two adjoining 
compartments, each with a shock grid 
as a floor, which were separated by a 
two-inch barrier surmounted by a guil- 
lotine-type door. In the first stage of 
the experiment, the experimental ani- 
mals (rats) were each given ten pres- 
entations of a pulsating light and tone 
paired with a pulsating shock. Each 
presentation consisted of nine seconds 
of light and tone, overlapping with a 
final six seconds of shock. No sys- 
tematic means of escape was provided 
during this stage of the experiment. 
The second step was to test the func- 
tional properties which had been ac- 
quired by the light and the tone as a 
result of their pairing with shock. 
Forty trials were given. On each trial 
the door was raised and the light and 





36 James A 


tone were presented without further 
shock. When the animal passed over 
the hurdle from one compartment to 
the other the light and tone were turned 
off and the door was lowered behind 
him. The time required for the ani- 
mal to respond was measured on suc- 
cessive trials. 

A group of control animals, which had 
not been shocked, were found to run 
somewhat less promptly from trial to 
trial. But the experimental animals 
ran more and more quickly; their la- 
tencies showed a rather sharp drop for 
the first 16 or 20 trials. Later, how- 
ever, a slight, but significant, rise ap- 
peared. The early decline in latency 
shows that the removal of the light and 
tone was a reinforcing operation which 
strengthened the response of running 
from one compartment to the other. 
The final rise in latency presumably re- 
flects the gradual loss which occurs in 
the effectiveness of secondary aversive 
stimulation when it is no longer paired 
with the primary stimulus. 

An attempt has been made by Bar- 
low (1) to specify more exactly what 
is the n essary temporal relation be- 
tween the primary and secondary stim- 
uli, and a related study has been con- 
ducted in avoidance training by Mowrer 
and Suter (16, pp. 280 ff.). These 
studies both suggest that the critical 
relationship is between some phase of 
the secondary stimulus and, more spe- 
cifically, the beginning or onset of the 
primary stimulus, such as shock. Simi- 
larly, the effects of presenting the sec- 
ondary stimulus without the accom- 
panying shock have been isolated and 
separately investigated in experiments 
by Schoenfeld and Antonitis (25) and 
by Page and Hall (23). These studies 
indicate that such a stimulus loses its 
aversive character when it is no longer 
paired with the primary stimulus. 

On the basis of several replications, 
then, the main fact seems to be reliably 


. DINSMOOR 


established: that a neutral stimulus 
which is presented just prior to or over- 
lapping with the administration of a 
primary aversive stimulus, like shock, 
acquires an aversive property in its own 
right and becomes what we may call a 
conditioned or secondary aversive stim- 
ulus. When we try to make use of this 
stimulus as a reinforcing agent, how- 
ever, a difficulty arises. The reinforc- 
ing operation—terminating the stimulus 
without shock—is incompatible with the 
establishing and maintaining operation, 
pairing the stimulus with shock. When 
it is terminated, therefore, without being 
paired with the primary stimulus, our 
secondary stimulus graduaily loses its 
effectiveness. The temporary nature of 
the secondary aversive property might 
seem to limit the role which these stim- 
uli can play in the maintenance of be- 
havior over an extended period of time. 
The difficulty is readily resolved, how- 
ever, if the pairing is restored when- 
ever the stimulus is weakened and the 
animal fails to respond within an arbi- 
trary time limit. This is the basic para- 
digm for what is known as avoidance 
training. 


How AvorpInc REACTIONS ARE 
MAINTAINED 


In studies like those we have just been 
considering, two separate and distinct 
operations have been employed in suc- 
cessive phases of the experimental pro- 
cedure: (a) the secondary stimulus is 
paired with the primary stimulus, and 
(6) the termination of the secondary 
stimulus is used to reinforce a selected 
response. In a simple and relatively 
effective form of avoidance training, 
these two procedures are interspersed or 
interwoven. At the beginning of each 
trial a secondary stimulus or “warning 
signal” is presented by the experimenter. 
If the animal makes the required re- 
sponse the signal is terminated; but 





THE AVOIDANCE HYPOTHESIS 37 


when the animal fails to respond within 
an arbitrary time limit, the primary 
stimulus is applied. 

As an example of this form of train- 
ing, let us take “Group III-On-Run”’ 
from an experiment by Mowrer and 
Lamoreaux (16, pp. 126 ff.; 20). The 
warning signal was a change in the pat- 
tern of illumination produced by turning 
on two overhead lamps and turning off 
a single lamp beneath the grid. Five 
seconds of grace were allowed in which 
the rat could run to the opposite end of 
the alley or shuttle box. If he did this, 
the signal was terminated, or changed 
back, and no shock was applied; if he 
did not, the stimulus was followed by 
two seconds of shock. On the first day 
of training, the two animals in this par- 
ticular subgroup made the response only 
twice apiece in 10 trials; but on the 
eleventh and twelfth days, both animals 
ran on all 10 trials. Thus, these animals 
learned to run to the opposite end of the 
apparatus when the only direct rein- 


forcement for this act was provided by 
the change from a pattern of stimula- 
tion that was otherwise followed by 


shock. The results were similar for 
other subgroups. Although the proce- 
dure which is used in avoidance train- 
ing is relatively complicated, the gen- 
eral effects of this procedure can be 
predicted from a study of the way in 
which neutral stimuli are made aversive 
or stripped of their aversive character. 

If the conditioning of avoiding re- 
sponses is based on the termination of 
secondary stimuli, and if the effective- 
ness of these secondary stimuli is based 
on their pairing with the shock, it fol- 
lows that some limit must be set on the 
frequency with which the avoiding re- 
sponse will be made. When the animal 
makes the required response for several 
trials in succession, the pairing opera- 
tion is interrupted and the effects of 
successive stimulus-terminations should 


dwindle. That this is indeed the case 
is suggested by more detailed observa- 
tions of the subjects’ behavior under 
fairly comparable conditions reported 
by Sheffield (26). Here the warning 
signal was a two-second tone (appar- 
ently of fixed duration), with the shock 
coming in the last tenth of a second. 
Guinea pigs were used as subjects. If 
the animal turned a rotating cage or ac- 
tivity wheel by 1 in. before the shock 
was due, the shock was omitted. “With 
successive omissions of the shock,”’ Shef- 
field reports, “the amplitude of the con- 
ditioned response tended to decrease 
and latency to increase, until the ani- 
mal failed to turn the wheel the re- 
quired inch in the required time, and 
another shock was received. As train- 
‘ng continued, more and more successive 
conditioned responses occurred without 
requiring [shock], but extinction be- 
tween [shocks] continued throughout 
the training” (26, p. 171). The ampli- 
tude was restored, of course, and the 
latency cut down once more after a 
trial on which the animal failed to re- 
spond and the shock was actually ad- 
ministered. 


AVOIDANCE TRAINING WITHOUT A 
SIGNAL 


Actually, no independent warning sig- 
nal need be presented by the experi- 
menter. This is demonstrated by some 
data recently reported by Sidman (27, 
28). In this study, one-fifth of a sec- 
ond shocks were administered to the 
rat at regular intervals of time (“shock- 
shock interval”). The animal was per- 
mitted to avoid or delay the shock, how- 
ever, by pressing a bar or lever at one 
end of the chamber. When he did so, 
the shock was postponed for another in- 
terval—sometimes the same, sometimes 
different—-which was timed from the 
beginning of each successive response 
(“response-shock interval”). Some fifty 





38 James A. DINSMOOR 


animals were successfully conditioned 
by this procedure. 

In an extension of his preliminary 
work, Sidman conducted three of his 
original animals through an extended 
series of training sessions at a variety 
of response-shock and shock-shock in- 
tervals. Regular changes in the rate 
of responding were obtained from each 
of these animals as a function of the 
length of either interval. In general, 
the more frequently the animals were 
shocked, the more frequently they re- 
sponded. The rate of responding rose 
with shorter and shorter shock-shock 
intervals and, up to a point, with shorter 
and shorter response-shock intervals. 
At relatively short response-shock inter- 
vals, however, a new phenomenon ap- 
peared: the original function gave way, 
and a “delay-of-punishment” gradient 
took over. That is, the rate began to 
decline with very short intervals be- 
tween the response and the subsequent 
shock, and the response tended to dis- 


appear when the shock followed almost 
immediately. 


AN INTERPRETATION OF AVOIDANCE 
TRAINING 


In interpreting Sidman’s data (and 
later, the operation of punishment) we 
are forced, in order to construct a gen- 
eral or inclusive description, to make 
an appeal to stimuli which are: (@) not 
clearly specified; (6) not readily ob- 
served or recorded; and (c) most im- 
portant, not under the direct control of 
the experimenter. We might, by anal- 
ogy, call the stimuli which may be pre- 
sented or withheld at the will of the 
experimenter “independent” stimuli and 
those which are produced by the sub- 
ject without the intervention of the ex- 
perimenter “dependent” or “interven- 
ing” stimuli, all in accord with the 
application of these adjectives to the 
noun “variable.” If the reader objects 


to an appeal to such stimuli, there are 
two ways in which he may place them, 
in a sense, under the control of the ex- 
perimenter. First, he may invade the 
organism by surgical or pharmacological 
techniques and presumably segregate or 
insulate the subject, in a manner of 
speaking, from the consequences of his 
own behavior. Or, as an alternative, 
he may add or subtract presumably 
equivalent stimuli to or from his ex- 
perimental operations, making them 
likewise contingent upon the subject’s 
response. Then we can see what effect 
these substitute dependent stimuli have 
on the over-all pattern of behavior. 
This procedure might well be used to 
substantiate Sidman’s interpretation of 
his results. 

Sidman’s interpretation may be para- 
phrased as follows. Any form of be- 
havior other than pressing the bar will 
eventually be followed by shock. The 
dependent stimuli that accompany such 
behavior thereby acquire an aversive 
character, through their pairing with 
the primary stimulus. But pressing the 
bar is never immediately followed by 
shock if a reasonably long response- 
shock interval is used, and the stimu- 
lation which accompanies this form of 
response does not become aversive. 
Hence, whenever a bar press follows 
some response that has previously been 
shocked, it will be reinforced by the 
change from an aversive to a nonaver- 
sive pattern of stimulation. At first 
only a few of the possible forms of re- 
sponse may have been shocked, while 
other responses have not been so paired. 
At this stage, the bar pressing is not 
always reinforced, and the rate of press- 
ing fluctuates. As the training contin- 
ues, however, more and more of the ani- 
mal’s behavioral repertoire is shocked, 
with the result that the avoiding re- 
sponse is fairly regularly reinforced. A 
ceiling is finally imposed by the very 
success of the avoiding response itself, 





THE AVOIDANCE HYPOTHESIS 39 


as in other studies, which reduces the 
over-all frequency of shock and thereby 
limits the frequency of pairing and the 
effectiveness of a given change in stimu- 
lation. 

We now see that the warning signal, 
which is usually provided in a study of 
avoidance training, plays a relatively 
subtle role. There are three possible 
relationships which tend to be con- 
founded and which are very difficult to 
segregate in a given study: 


1. As in Sidman’s procedure, the 
stimuli produced by S himself in mak- 
ing responses other than that prescribed 
by E are to some extent correlated with 
the shock and presumably provide some 
basis for the reinforcement of the avoid- 
ing reaction. This may be one of the 
sources for the responding that occurs 
in the absence of the signal, between 
successive trials (e.g., 19, 21). But 


these stimuli are in actuality paired 
with the shock only when they are ac- 


companied by the signal. Thus, the 
“true” or most effective secondary stim- 
uli are a set of stimulus combinations 
or compounds (24), each including the 
warning signal as one of its elements. 
The effectiveness of these compounds 
presumably depends on the exact tem- 
poral relation between the signal and 
the shock (e.g., 16, pp. 280 ff.; 37). 

2. Furthermore, the effectiveness of 
the training procedure seems to depend 
also on whether both elements of these 
compounds are simultaneously termi- 
nated by the avoiding response; if the 
signal element is terminated before or 
after the response, the change in stimu- 
lation produced by the response is some- 
what smaller and less discriminable, and 
the reinforcement is less effective (16, 
pp. 84 ff.; 19,21). In avoidance train- 
ing, as such, the relation between the 
signal termination and the response is 
necessarily confounded with the tem- 
poral relation between the signal and 


the shock; but it may be separated in 
a study of secondary aversive stimula- 
tion, where the operations of stimulus 
pairing and reinforcement have been 
segregated. 

3. Finally, the warning signal may 
“set the occasion” for the (maximal) 
reinforcement of the avoiding response. 
That is, it seems to act like a cue (8) 
or discriminative stimulus (14, 29, 30). 
Early in training the rate of responding 
may be about the same in the presence 
of the signal or in its absence (between 
“trials”); but as the training continues, 
more responses occur in the presence of 
the signal and fewer occur in its absence, 
provided that an opportunity is given 
for the animal to make these nonrein- 
forced responses (3)—-t.e., for extinc- 
tion of responses in the absence of the 
signal, 

Mowrer and Lamoreaux have dis- 
cussed this problem in somewhat similar 
terms, and conclude that it is the ne- 
cessity for forming a discrimination of 
some type between the presence and 
absence of the signal which slows down 
the acquisition of avoiding responses 
under the customary procedures (21). 


A COMPARISON OF PROCEDURES 


Now that we have seen how the sub- 
ject learns to prevent the arrival of the 
shock in a study of avoidance training, 
we are in a position to apply these prin- 
ciples to the inhibition or suppression 
of the response which is observed in 
most studies of punishment. Actually, 
the two procedures are so similar that 
it is difficult to find a justification for 
any major distinction in theoretical 
treatment. As Mowrer says, such a 
distinction is “far from parsimonious, 
not to say an outright contradiction” 
(17, p. 421). What distinctions there 
are, of course, arise from the fact that 
in avoidance training the experimenter 
selects the response that shall be re- 





40 James A. DINSMOOR 


quired to avoid the shock; whereas in 
an experiment on punishment he speci- 
fies and records the response that pro- 
duces the shock. 

This does, however, lead to certain 
consequences which may be worth in- 
specting. First, in avoidance training 
the class of responses which are mot 
followed by shock is extremely narrow; 
it includes but one form—that speci- 
fied by the experimenter as the avoid- 
ing response. But in a free responding 
situation the class of responses which 
will eventually be paired with the shock 
is extremely broad, including anything 
else the animal might do. The breadth 
of definition for these two classes of 
response is also reflected in the initial 
frequencies of behavior: before condi- 
tioning, the frequency of the avoiding 
response is likely to be relatively low, 
to constitute a small part, quantitatively 
as well as qualitatively, of the animal’s 
activity; the combined frequency of 
other forms of behavior will be rela- 
tively high. 

The situation is reversed in a study 
of punishment. Here, it is the class of 
responses which are followed by shock, 
for example, which is limited to a single 
behavioral sequence or chain. And in 
the usual experimental situation the ini- 
tial frequency of this sequence is very 
low, so low, in fact, that it is ordinarily 
necessary to provide some form of re- 
inforcement to boost its rate to a level 
where the inhibition may readily be 
observed. But the class of responses 
which are not followed by shock is rela- 
tively broad, including any form of re- 
sponse which conflicts with members of 
the punished sequence; and these re- 
sponses are already quite plentiful at 
the beginning of training. 

Second, the experimenter has chosen 
different criteria for the administration 
of the shock in the two cases, and this 
alters the detailed response-shock con- 


tingencies both for the avoiding re- 
sponses and for the punished responses. 
In avoidance proper, he delivers the 
shock at regular intervals whenever the 
animal fails to make the required re- 
sponse. He does not specify the exact 
relationship between other forms of re- 
sponse and the arrival of the punish- 
ment. A given alternative, therefore, 
need not immediately or invariably be 
accompanied by shock unless this is 
continuous, as in simple escape train- 
ing. Thus, a certain amount of time 
will be required before each of these 
responses has effectively been paired 
with the punishment. 

But in a study of punishment itself, 
the shock is directly contingent upon 
making a particular response. The 
pairinyy can be made immediate and in- 
variable, unless the experimenter him- 
self wills it otherwise. Special sched- 
ules, such as delayed or intermittent 
punishment, may readily be imposed. 

Similarly, in a study of avoidance 
training the experimenter not only de- 
cides that a certain form of response 
shall lead to avoidance of the shock, 
he also determines how long the shock 
will be postponed following this re- 
sponse, if it is not repeated—as illus- 
trated by Sidman’s “response-shock in- 
terval” or by the customary interval 
between trials in a signal study. 

Not so with punishment: Here, the 
relationship between the avoiding re- 
sponses and the shock is less direct, 
for it depends on what these do to the 
original sequence of behavior. The con- 
sequent variation in the delay of pun- 
ishment has a selective effect on various 
forms and durations of avoiding re- 
sponse (27, 28, 36, 37). The way is 
opened for a “shaping up” or differen- 
tiation of the original avoiding behavior. 

Finally, the observations are different. 
In an avoidance study the experimenter 
has defined, by his criterion for admin- 





Tue AvomIpANCE HYPOTHESIS 41 


istering or withholding the shock, the 
form of the avoiding response. This 
turns out to be the only response which 
can readily be recorded, since the alter- 
native forms have not been defined and 
a specified alternative may not be in- 
clusive. In a punishment study, on the 
other hand, it is the punished response 
which has been defined, and the avoid- 
ing responses cannot readily be re- 
corded. In either experiment some- 
thing might be gained by recording one 
response as a representative of the class 
of behavior which is not ordinarily ob- 
served, although the frequency of any 
single response is likely to be too low, 
unless experimentally reinforced, to pro- 
vide a very sensitive index to the re- 
mainder of the animal’s behavior. 


DISCRIMINATION OF THE AVOIDING 
REACTIONS 


There is one special problem in ap- 
plying avoidance theory to the action 
of punishment which has not been ex- 
plicitly discussed by previous writers. 
The behavior which is punished consti- 
tutes only a small fraction, qualitatively 
speaking, of the animal’s total reper- 
toire. In a laboratory study, to be sure, 
this behavior may have been strength- 
ened to such an extent by direct experi- 
mental reinforcement that it constitutes 
a relatively large proportion, quantita- 
tively speaking, of the animal’s activity. 
In this case, elements of the punished 
sequence will intrude at frequent inter- 
vals between the avoiding responses. 
Most of the animal’s behavior should 
remain “relevant” to the punished se- 
quence. If it is the pressing of a bar 
which is punished, for example, he 
should spend the bulk of his time in 
the vicinity of this bar. He will not 
“get very far,” to judge from analogous 
data (15), from the punished act. Since 
under these circumstances the animal is 


almost always in danger of being pun- 
ished, no special timing or discrimina- 
tion of his avoiding responses may be 
required. 

But in this respect the laboratory 
does not necessarily mirror life. Out- 
side of the laboratory a given sequence 
may not have such a degree of strength. 
The animal may spend much of his time 
in activity which is essentially “irrele- 
vant” to the punished response, i.e., 
which shows no major change in fre- 
quency following the institution of pun- 
ishment. The punished responses in- 
trude only occasionally among a variety 
of other forms of behavior. If we as- 
sume that there is some over-all limit 
to the frequency of the avoiding re- 
sponses, a discrimination seems to be 
necessary. For if discrete avoiding re- 
sponses were interspersed at random, 
regardless of what the animal might be 
doing, they would not conflict tem- 
porally with the appearance of the re- 
sponse which is punished and should 
have relatively little effect on its fre- 
quency. 

In order, then, to inhibit or suppress 
the punished response, the avoiding re- 
sponses must in a sense anticipate or 
forestall it by arising at just the mo- 
ment when this response itself would 
otherwise appear. They must, we might 
say, be correlated with its expected oc- 
currence. No “expectancy” construct, 
however, is required. The problem is 
much the same as the problem of ac- 
counting for the proper timing of avoid- 
ing responses to a signal, so that they 
may appear just prior to the primary 
stimulus. And the answer to this prob- 
lem, too, is quite analogous. 


CHAINING 


Neither our own everyday behavior 
nor the activity of one of our subjects 
in the laboratory is made up, in atomis- 
tic fashion, of a random series of dis- 





42 James A. DINSMOOR 


crete and unrelated acts. Experimen- 
tally reinforced behavior, in particular, 
flows along in fairly orderly and regular 
sequences or “chains” (14, 30), as may 
be established by the most casual ob- 
servation. Most of our laboratory rec- 
ords, it is true, depend on timing or 
tallying a single response, such as press- 
ing a bar. turning to the right, or enter- 
ing a goal box. We do not and cannot 
record and quantify everything that the 


animal does. This should not lead us, 


however, to ignore, where relevant, the 
fact that the behavior which we are 
studying in the “modified Skinner box,” 
the T maze, or the runway actually con- 
sists of a continuous flow of activity 
from which we have rather arbitrarily 
abstracted a single, readily recorded ele- 


ment. 

Again we are forced to consider stim- 
uli which are not directly under the 
control of the experimenter, for each of 
the actions in a behavioral sequence has 
some effect on the current stimulation. 
An action may enlarge, contract, add, 
subtract, or otherwise alter some set of 
visual stimuli as the animal turns his 
head or moves about; it may bring him 
into physical contact with some object 
in his environment, such as a bar, a 
pellet, a barrier, or a wall; it may pro- 
duce apparatus noises or bring new 
odors; or, as a minimum, it will nor- 
mally produce a certain amount of 
proprioceptive stimulation. Although 
these stimuli arise in the chain as a 
natural consequence of the animal’s own 
behavior, without any special interven- 
tion by the experimenter, we can largely 
duplicate their relationships to a par- 
ticular response or their own interrela- 
tionships by direct experimental manip- 
ulation. Work of this sort has been 
conducted largely under the headings of 
discrimination training and secondary 
reinforcement. 


DISCRIMINATIVE AND REINFORCING 
STIMULI 


A given chain is completed and re- 
inforced only when the necessary mem- 
bers occur in the proper sequence or 
order. It will not do, for example, for 
the animal to go through the motions 
of pressing a bar when it is at the op- 
posite end of the cage, or to chew be- 
fore the pellet is in the mouth. The 
function of signalling, so to speak, when 
to make a given response, or of “setting 
the occasion for” this response, is per- 
formed by the stimulus elements in the 
chain. There is a three-term relation- 
ship here: discriminative stimulus— re- 
sponse — reinforcement. It is only in 
the presence of the discrin:inative stim- 
ulus, as Skinner has called it (30, 31), 
or S”, that the next response in the 
chain is appropriate and actually leads 
to a reinforcing state of affairs. This 
relationship is probably well known to 
most of my readers and need not be 
labored here. Empirical demonstrations 
are numerous. They show that when 
the reinforcement of a given response— 
e.g., pressing a bar—is made to de- 
pend on the prior presence of a certain 
stimulus, be it wholly arbitrary, the ani- 
mal comes to make the response quite 
promptly (4, 29, 30), or with increased 
frequency (5), when the stimulus ap- 
pears, but fails to respond with any 
great frequency when this stimulus is 
absent. 

One of the stimulus functions, then, 
which is crucial to the formation of a 
chain, is the acquisition by this stimu- 
lus of discriminative properties. In 
addition it would appear that such stim- 
uli also acquire reinforcing properties, 
along with their discriminative role, so 
that they also serve to maintain the 
strength of the response which produces 
them (e.g., 10, 30). Although the ac- 
quisition and loss of this property seem 
to be governed by the same factors 





THE AVOIDANCE HYPOTHESIS 43 


which govern the acquisition and loss 
of a discriminative property (4, 7, 22, 
39), we customarily refer to these stim- 
uli—while exercising their reinforcing 
function—as secondary reinforcers. 
Let us now consider what happens 
when an aversive stimulus like shock is 
applied as a punishment following some 
particular member of the chain. Again 
we have a three-term relationship: dis- 
criminative stimulus—response — aver- 
sive stimulation. The punished re- 
sponse follows upon its appropriate 
stimulus; the punishment itself follows 
upon the response; thus, through the 
mediation of the animal’s own behav- 
ior, aversive stimulation is paired or 
correlated rather specifically with the 
discriminative stimulus for the punished 
response. If, furthermore, the entire 
chain is run off fairly regularly and 
fairly swiftly, the aversive stimulation 
may also follow rather closely upon 
some of the stimuli which appear earlier 
in the sequence. And finally, it is 


closely associated with whatever stimu- 
lation may arise during the execution 
of the punished act itself (12, 16, p. 


262; 17). This reduces to the same 
analysis if we break down what we had 
hitherto regarded as a single act into 
a more detailed sequence or chain in 
its own right. 

These stimuli, then, play a role which 
is similar to that of the “warning sig- 
nal” in the conventional study of avoid- 
ance training. Patterns of stimulation 
which include these elements are more 
closely and more frequently paired with 
the shock, and should be more effective 
as aversive compounds; responses which 
terminate these elements should be max- 
imally reinforced; and by “setting the 
occasion” for maximal reinforcement, 
these stimulus elements should serve as 
cues or discriminative stimuli for the 
avoiding responses. (We do, in fact, 
find that arbitrary stimuli which indi- 
cate the punishment or nonpunishment 


of a given response do affect its fre- 
quency [6].) In a sense, then, these 
are not only discriminative and rein- 
forcing stimuli for members of the chain 
but discriminative and reinforcing (1.e., 
by their termination) stimuli for a cor- 
responding set of avoiding reactions. 


DIFFERENTIATION OF THE AVOIDING 
REACTIONS 


Just as the animal must learn to make 
his avoiding responses at the time when 
the punished response is about to occur, 
to make them temporally incompatible, 
so he must also learn to make his re- 
sponses of such a form that they are 
physically or topographically incompati- 
ble with the punished response (or with 
earlier members of the chain). It is 
obvious how this occurs. The pairing 
between the discriminative stimuli and 
the punishment is mediated, as I have 
said, by the animal’s own behavior on 
continuing with the chain and making 
the punished response. The avoiding 
responses are reinforced precisely be- 
cause they are incompatible with the 
original sequence; otherwise, they too 
would be followed by shock. While we 
cannot specify the exact form which 
these new responses will take, we can, 
from our knowledge of the basis of their 
reinforcement, make some tentative pre- 
dictions. 

First, the animal may halt, “freeze,” 
or hold a pose. This probably involves 
a fine-grain vacillation between incip- 
ient movements toward completing the 
chain and opposing movements which 
serve to restore the original position 
(38). There may be some tendency 
for the animal to hold these positions 
for longer and longer durations (12, 
Experiment 3), as this further delays 
the punishment. But this development 
will be limited by the strength of the 
original chain (35, 38), and the mean 
duration of these holding responses will 





44 James A 


presumably reflect most of the variables 
which influence the original rate of the 
punished response. 

Again, the animal may make re- 
sponses which are incompatible with 
the next member of the chain and serve 
as digressions from the sequence. Even 
a slight delay in the completion of 
the chain may to some extent be re- 
inforced, and a certain amount of seem- 
ingly pointless “boondoggling” may be 
expected, like the dilatory behavior of 
a small child heading for bed. The ani- 
mal may scratch himself, stand on his 
hind legs, “wash his face,” or push the 
sawdust about. If these responses do 
nothing to cancel the previous member 
of the chain, however, they may well 
be followed by immediate completion 
of the original sequence. 

A certain premium is therefore placed 
on those forms of response which 
“undo” or cancel out one of the mem- 
bers of the sequence by an opposing 
movement or a reversal of the progres- 
sion. The animal may let the bar come 
up again; he may drop or let go of some 
object; he may turn his head away from 
the visual stimuli; or he may withdraw 
bodily from the locus of the punishment. 
These responses remove the most impor- 
tant elements of the aversive pattern, 
namely, the discriminative stimuli for 
the next response in the chain. Fur- 
thermore, they “set him back” in his 
progress, so that he is forced to repeat 
one or more members of the chain to 
get back to the point where he was be- 
fore. Thus, some differentiation of the 
form of the avoiding responses would 
seem likely, on the basis of selective re- 
inforcement—variations in the temporal 
interval between the response and the 
punishment (27, 28, 36, 37). Lengthy 
sequences of incompatible responding, 
such as wandering to the opposite end 
of the cage, might be strengthened to 
some extent if the original sequence is 


. DINSMOOR 


weak; but these too are limited by the 
tendency to return to the chain. If the 
punished behavior is relatively strong, 
they may even be “crowded out” by the 
combined interference resulting from the 
original chain plus more localized avoid- 
ing responses. The over-all situation is 
reminiscent of the “equilibrium” studied 
by Miller, Brown, and Lipofsky (in 15), 
although their analysis is limited to a 
somewhat specialized situation. 


SUMMARY 


By punishing an animal for making a 
given response (that is, by applying 
aversive stimulation), we can reduce its 
frequency of occurrence. The purpose 
of this paper has been tc show how we 
can fit this observation into a more 
general theoretical framework without 
adding new and independent principles 
to our system. Accordingly, I have 
tried to deduce the main effects of 
punishment from the principles already 
demonstrated in studies of second- 
ary aversive stimulation and avoidance 
training. In general, my hypothesis has 
run as follows: The punished response 
is not an isolated incident, in vacuo, 
but a member of some sequence or chain 
of responses which is linked together by 
a series of discriminative, and thereby 
secondary reinforcing, stimuli. The 
stimuli which come immediately before 
the punished response are paired by 
the response itself with the ensuing pun- 
ishment. By virtue of this pairing, 
they gain an aversive property in their 
own right. Any form of behavior which 
is incompatible with some member of 
the chain and delays the completion of 
the sequence will be reinforced, and 
thereby conditioned and maintained, by 
the corresponding elimination or trans- 
formation of these conditioned or sec- 
ondary aversive stimuli. These re- 
sponses are functionally equivalent to 





THE AVOIDANCE HYPOTHESIS 45 


the responses which are investigated in 
a formal study of avoidance condition- 
ing. The fruitfulness of this hypothesis 
may therefore be tested by a detailed 
comparison of the functional relations 
observed in studies of punishment and 
studies of avoidance training. 


REFERENCES 


. Bartow, J. A. Secondary motivation 
through classical conditioning: one trial 
nonmotor learning in the white rat. 
Amer. Psychologist, 1952, 7, 273. (Ab- 
stract) 

. Brown, J. S., & Jacons, A. The role of 
fear in the motivation and acquisition 
of responses. J. exp. Psychol., 1949, 
39, 747-759. 

. Coprpock, H., & Mowrer, O. H. _Inter- 
trial responses as “rehearsal”: a study 
of “overt thinkinz” in animals. Amer. 
J. Psychol., 1947, 60, 608-616. 

. Dinsmoor, J. A. A quantitative compari- 
son of the discriminative and reinforc- 
ing functions of a stimulus. J. exp. 
Psychol., 1950, 40, 458-472. 

. Drnsmoor, J. A. The effect of periodic 
reinforcement of bar-pressing in the 
presence of a discriminative stimulus. 
J. comp. physiol. Psychol., 1951, 44, 
354-361. 

. Drnsmoor, J. A. A discrimination based 
on punishment. Quart. J. exp. Psychol., 
1952, 4, 27-45. 

. Drnsmoor, J. A. Resistance to extinction 
following periodic reinforcement in the 
presence of a discriminative stimulus. 
J. comp. physiol. Psychol., 1952, 45, 
31-35. 

. Dorrarp, J., & Mrtter, N. E. Personality 
and psychotherapy. New York: Mc- 
Graw-Hill, 1950. 

. Estes, W. K. An experimental study of 
punishment. Psychol. Monogr., 1944, 
57, No. 3 (Whole No. 263). 

. Ferster, C. B. Sustained behavior under 
delayed reinforcement. J. exp. Psychol., 
1953, 45, 218-224. 

. Guturim, E. R. The psychology of learn- 
ing. (Rev. Ed.) New York: Harper, 
1952. 

. HEFFERLINE, R. F. An experimental study 
of avoidance. Genet. Psychol. Monogr., 
1950, 42, 231-334. 

. Hitcarp, E. R., & Marquis, D. G. Con- 
ditioning and learning. New York: Ap- 
pleton-Century, 1940. 


14. 


Ketter, F. S., & SCHOENFELD, W. N. 
Principles of psychology. New York: 
Appleton-Century-Crofts, 1950. 


. Mrtter, N. E. Experimental studies of 


conflict. In J. McV. Hunt (Ed.), Per- 
sonality and the behavior disorders. 
Vol. 1. New York: Ronald, 1944. Pp. 
431-465. 


. Mowrer, O. H. Learning theory and per- 


sonality dynamics. New York: Ronald, 
1950. 


. Mowrer, O. H. Motivation. Annu. Rev. 


Psychol., 1952, 3, 419-438. 


. Mowrer, O. H., & Kruckuonn, C. Dy- 


namic theory of personality. In J.McV. 
Hunt (Ed.), Personality and the be- 
havior disorders. Vol. 1. New York: 
Ronald, 1944. Pp. 69-135. 


. Mowrer, O. H., & LAamoreaux, R. R. 


Avoidance conditioning and signal dura- 
tion—a study of secondary motivation 
and reward. Psychol. Monogr., 1942, 
54, No. 5 (Whole No. 247). 


. Mowrer, O. H., & LAmMoreAux, R. R. 


Fear as an intervening variable in 
avoidance conditioning. J. comp. Psy- 
chol., 1946, 39, 29-50. 


. Mowrer, O. H., & LAmoreAux, R. R. 


Conditioning and conditionality (dis- 
crimination). Psychol. Rev., 1951, 58, 
196-212. 


. NoTTERMAN, J. M. The interrelationships 


among aperiodic reinforcement, discrim- 
ination learning, and secondary rein- 
forcement. J. exp. Psychol., 1951, 41, 
161-169. 


. Pace, H. A., & Harr, J. F. Experimental 


extinction as a function of the preven- 
tion of a response. J. comp. physiol. 
Psychol., 1953, 46, 33-34. 


. SCHOENFELD, W. N. An experimental ap- 


proach to anxiety, escape, and avoid- 
ance behavior. In P. J. Hoch & J. 
Zubin (Eds.), Anxiety. New York: 
Grune & Stratton, 1950. Pp. 70-99. 


. SCHOENFELD, W. N., & AnrTonitis, J. J. 


A function of respondents in the ex- 
tinction of operant responses. Conf. 
exp. Anal. Behav.—Notes, 1949, No. 17. 
(Mimeo.) 


. SHEFFIELD, F. D. Avoidance training and 


the contiguity principle. J. comp. 
physiol. Psychol., 1948, 41, 165-177. 


. Sipman, M. Avoidance conditioning with 


brief shock and no exteroceptive warn- 
ing signal. Science, 1953, 118, 157-158. 


. Smman, M. Two temporal parameters 


of the maintenance of avoidance be- 
hayior by the white rat. J. comp. 
physiol. Psychol., 1953, 46, 253-261. 





1. SKINNER, B. F. 


JAMES A. 


. SKINNER, B. F. The rate of establishment 
of a discrimination. J. gen. Psychol., 
1933, 9, 302-350. 

. SkrnNER, B. F. The behavior of organ- 

isms. New York: Appleton-Century, 

1938. 

Science and human be- 

havior. New York: Macmillan, 1953. 

. Stone, G. R. The effect of negative in- 
centives in serial learning: II. Incentive 
intensity and response variability. J. 
gen. Psychol., 1950, 42, 179-224. 

Tuornpike, E. L. The fundamentals of 
learning. New York: Teachers Coll., 

1932. 

. THorNpDIKE, E. L. The psychology of 
wants, and attitudes. New 
York: Appleton-Century, 1935. 

Totcott, M. A. Conflict: a study of 
some interactions between appetite and 


interests, 


DINSMOOR 


aversion in the white rat. Genet. Psy- 
chol. Monogr., 1948, 38, 83-142. 

. Warven, C. J., & Diamonp, S. A pre- 
liminary study of the effect of delayed 
punishment on learning in the white rat. 
J. genet. Psychol., 1931, 39, 455-461. 

. Warner, L. H. 
the white rat. 
41, 57-90. 

8. Wrnnick, Wirma A. A study of incipient 
movements in avoidance. Unpublished 
doctor’s dissertation, Columbia Univer., 
1950. 

. Wycxorr, L. B. The role of observing 

in discrimination learning. 

Unpublished doctor’s dissertation, Indi- 

ana Univer., 1951. 


The association span of 
J. genet. Psychol., 1932, 


responses 


(Received February 4, 1953) 





Psychological Review 
Vol. 61, No. 1, 1954 


THE MEASUREMENT OF VALUES ' 


L. L. THURSTONE 


University of North Carolina 


In this paper I shall try to summa- 
rize briefly the attempts of several in- 
vestigators to extend the concepts of 
measurement to the subjective domain. 
While this work is admittedly crude and 
exploratory, the results do look promis- 
ing so that this field should be challeng- 
ing for further study. Here we shall 
give only brief statements of the funda- 
mental ideas without details of theory 
or experimental procedure. Our pur- 
pose here is only to sketch the nature 
of this field of research. 

When we propose to measure human 
values, colleagues in the humanities may 
shudder at the very idea. When I wrote 
a paper entitled “Attitudes Can Be 
Measured,” some of my colleagues did 
shudder. They were sure that social 
attitudes contain some essence that 
could not be identified and measured. 
They were sure that, in making the at- 
tempt, we would measure only the triv- 
ial. 

Human values are essentially subjec- 
tive. They can certainly not be ade- 
quately represented by physical objects. 
Their intensities or magnitudes cannot 
be represented by physical measure- 
ment. At the very start we are faced 
with the problem of establishing a sub- 
jective metric. This is the central 
theme in modern psychophysics in its 
many applications to the measurement 
of social values, moral values, and es- 
thetic values. Exactly the same prob- 
lem reappears in the measurement of 
utility in economics. 

In order to establish a subjective met- 
ric we must have a subjective unit of 


‘This paper was read at the Southern So- 
ciety of Philosophy and Psychology in Knox- 
ville, Tennessee, on April 11, 1952. 


measurement. Before we can accept a 
subjective metric, it must satisfy the 
logical requirements of measurement as 
distinguished from rank order. These 
objectives have been approximated in 
the equation of comparative judgment 
and its variants. 

Before proceeding to discuss the 
many applications of the subjective met- 
ric, we shall review briefly the principal 
psychophysical concepts by which a 
subjective metric can be established. 

Let us consider these concepts in 
terms of a rather simple example, 
namely, the judgment of excellence of 
handwriting. When we look at several 


specimens of handwriting, it is fairly 
easy to select some that are considered 
to be excellent 
judged to be poor. 


and others that are 
In general, there is 
good agreement in such judgments. If 
we were asked to equate our judgments 
of excellence in a handwriting specimen 
to some physical measurements on the 
script, we would find it difficult. One 
of the main requirements of a truly sub- 
jective metric is that it shall be entirely 
independent of all physical measure- 
ment. In freeing ourselves completely 
from physical measurement, we are also 
free to experiment with esthetic objects 
and with many other types of stimuli 
to which there does not correspond any 
known physical measurement. 

If we present a single handwriting 
specimen to a subject with the request 
that he tell us how good he thinks it 
is, then he must try to convey the de- 
gree of excellence in terms of words. 
It is well known that people vary tre- 
mendously in their use of superlatives 
in appraisals of experience, and, con- 
sequently, it is preferable to avoid such 





48 L. L. THURSTONE 


a direct procedure. Next we proceed 
to pairs of stimuli. We can ask the sub- 
ject to judge which is the better of two 
specimens. In so doing, the subject 
gives his comparative judgment for each 
pair and he is not asked to give any 
verbal description of excellence. 

The degree of excellence of a hand- 
writing specimen is experienced by the 
subject in terms of some subjective 
process or quale. Since nothing is 
known about the neurological correlates 
of judgments of excellence of handwrit- 
ing, we shall dodge all such terminology 
by merely referring to the discriminal 
processes by which the subject does, in 
fact, discriminate between the different 
specimens. These processes may be as- 
sumed to be physical or truly subjective 
according to the preferences of the in- 
vestigator. His preference on this point 


has nothing to do with the subsequent 
development of the law of comparative 
judgment. 

When the subject makes a judgment 


that one specimen seems to him to be 
better than another specimen, we postu- 
late discriminal processes which differ 
in some manner in terms of which the 
percipient does make the discrimination. 
The more excellent specimen has some 
quale which differs from that of the 
poorer specimen. Imagine that the dis- 
criminal processes which correspond to 
different values are arranged in a spec- 
trum from those discriminal processes 
in terms of which the percipient ex- 
periences the good specimens to the 
other end of the spectrum with discrim- 
inal processes in terms of which he ex- 
periences what he calls the poorer speci- 
mens. 

Consider next the phenomena of dis- 
persion. If one subject were to examine 
the same specimen in comparative situ- 
ations on a large number of occasions, 
it is not to be expected that he would 
always experience a particular specimen 
with the same discriminal process. It 


can be assumed that the same specimen 
will be experienced in terms of discrim- 
inal processes in the same general region 
of the subjective continuum that has 
been postulated. So far we have no 
metric. 

At this point we recall one of the fun- 
damental restrictions on the problem of 
establishing a subjective metric. The 
discriminal processes must be assumed 
to be of such a character that they do 
not necessarily have intensities or mag- 
nitudes which can be in any sense meas- 
ured. This is an old problem that was 
discussed many years ago in psycho- 
physical theory. For theoretical con- 
siderations, imagine that the discriminal 
processes could actually be identified on 
each occasion when the subject makes 
a comparative judgment. The repeated 
observations of the same specimen can 
be assumed to produce an error varia- 
tion from one occasion to the next. If 
we consider the relative frequencies of 
these discriminal processes as responses 
to the same stimulus, then we can postu- 
late a Gaussian error distribution for the 
responses to the repeated observations 
of the same stimulus. Let us now as- 
sume that the spectrum of discriminal 
processes is stretched or contracted in 
different parts in such a way that the 
frequency distribution of these processes 
is Gaussian in terms of any given stim- 
ulus. Now we have a metric, but it 
is so far an entirely arbitrary metric. 
Imagine, at least in theory, that the 
same procedure can be repeated for 
many different stimuli which cover the 
whole range of discriminal processes in 
terms of which degree of excellence is 
experienced. It is now a question of 
experimental fact whether the metrics 
determined for the separate stimuli will 
be the same when all of the stimuli are 
considered together. It has been found 
in many experiments that such is the 
case. 

If we represent in the same model the 





THE MEASUREMENT OF VALUES 49 


comparative judgment of two stimuli in 
which the subject says for each presen- 
tation which of the pair is the better, 
then we can observe the proportion of 
attempts in which the subject judges 
specimen j to be better than specimen &. 
If we have a whole table of such pro- 
portions, it is possible to infer the spa- 
tial separations of the different distri- 
butions of discriminal processes. Each 
stimulus is then assumed to project a 
Gaussian distribution on the subjective 
continuum with a mean and a discrim- 
inal dispersion. An ambiguous stimulus 
will project a wider dispersion on the 
subjective continuum than a sharply de- 
fined or relatively unambiguous stimu- 
lus. Each stimulus will then be defined 
in the subjective continuum by its mean 
position which is called a scale value 
and by the standard deviation of its dis- 
persion of discriminal processes. Each 
stimulus is then defined by two parame- 
ters in the subjective continuum. 


Before we can put numbers into these 
parameters, we must define an arbitrary 
origin which may be taken as the mean 
value that one of the stimuli projects on 


the continuum. As a unit of meas- 
urement we may choose arbitrarily the 
standard deviation of the dispersion 
which that stimulus projects on the sub- 
jective continuum. When that has been 
done, similar numerical values can be 
assigned to all of the other specimens 
that have entered into the comparative 
judgments. Further, we can test for the 
internal consistency of this theoretical 
model. 

It should be carefully noted that we 
have not assumed that the discriminal 
processes have magnitudes of any kind. 
They have been dealt with merely as 
subjective quales and we have assumed 
only that in principle their relative fre- 
quency of association with any given 
stimulus can be ascertained. While this 
cannot be done directly, these frequen- 
cies can be inferred indirectly from the 


observed comparative data. It should 
also be noted that we have not postu- 
lated the existence of any physical meas- 
ures of any kind for the stimuli that 
have entered into the comparative judg- 
ments. 

With this formulation of the law of 
comparative judgment, we are free to 
proceed with comparative studies of all 
kinds of stimuli which have no physical 
measure whatever. Hence we can turn 
to a wide array of interesting psycho- 
logical problems involving value judg- 
ments. The freedom from any postu- 
lated physical measurement is the key 
that makes studies of this kind possible. 

The method of comparative judgment 
turns out to be a rather general experi- 
mental procedure, and the well-known 
constant method in psychophysics is a 
special case in which one of the stim- 
uli is arbitrarily taken as the standard 
which is compared with all of the other 
stimuli. Classical psychophysics was 
concerned with the more restricted prob- 
lem of limen determinations. 

We turn next to a brief review of 
some of the classical psychophysical 
methods because some of them have 
application in modern problems which 
transcend the determination of limens. 
In the method of equal-appearing inter- 
vals, the subject is asked to sort a large 
number of stimuli into a specified num- 
ber of successive categories, say six or 
eight or ten. He is instructed to sort 
them in such a way that the intervals 
represented by the categories seem to 
him to be equal. This method is use- 
ful for rough survey purposes, but it 
can be shown that, even when the sub- 
ject attempts to do this, he actually does 
not succeed in making the intervals sub- 
jectively equal. The method is, how- 
ever, useful for coarse scaling such as 
the construction of attitude scales. The 
old method of equal-appearing intervals 
has been modified into what we call the 
method of successive intervals, in which 





50 L. L. THURSTONE 


the intervals are defined by descriptive 
phrases or by sample specimens. This 
method has been found to be very use- 
ful in various types of surveys to be 
discussed. 

One of the old psychophysical meth- 
ods was to ask the subject to sort a 
number of specimens into rank order. 
It has been found that rank orders can 
be analyzed in such a way as to obtain 
data approximately equivalent to that 
of the method of paired comparison. 
The method of successive intervals can 
even be analyzed as a variant of the 
method of single stimuli. 

Since Weber’s law and Fechner’s law 
have figured so prominently in the his- 
tory of psychophysics, we shall make a 
few comments about these two laws in 
relation to the modern setting. These 
two laws are frequently referred to a. 
the Weber-Fechner law with the impli- 
cation that they are the same law, but 
that is an error. It is possible to set 


up experiments with rather simple stim- 
uli in which one of these laws will be 
verified when the other one is not veri- 


fied. It would be useful to set up such 
experiments in order to show clearly the 
separation between the two laws. Web- 
er’s law states that the proportion of 
judgments R>&R is a constant. R 
signifies here the physical magnitude of 
the stimulus and & represents another 
constant. Weber's law is concerned 
solely with physical measurements. It 
does not explicitly refer to the subjec- 
tive continuum. On the other hand, 
Fechner’s law states frankly the relation 
between the subjective continuum and 
the physical stimulus continuum. Fech- 
ner’s law states that this relation is 
generally logarithmic, and it should be 
taken as a rough approximation to the 
relation between the subjective and the 
physical continua. Further, it can be 
seen that Fechner’s law is applicable 
only to those stimuli which have a 
physical magnitude as well as an ex- 


perienced intensity. The law of com- 
parative judgment is completely in- 
dependent of any physical stimulus 
magnitudes. The problem of the stim- 
ulus error is not ordinarily of serious 
concern to our problem. It deals with 
the ambiguity in the mind of the sub- 
ject when he is asked to judge a stimu- 
lus as to the intensity of the subjective 
experience. Sometimes he attempts in- 
stead to judge the physical magnitude. 
A good example is that of a grocery 
clerk who can judge the weight of a 
bag of sugar. If he were asked to serve 
as a subject in the method of mean 
gradation, he would probably commit 
what Titchener would have called the 
stimulus error. In the measurement of 
social values, we are not interested in 
physical measurements because in gen- 
eral they do not exist for such values. 

A very important advance in the ap- 
plication of psychophysical methods was 
accomplished by Richardson when he 
devised the triad method for studying 
the dimensionality of a domain. In- 
stead of asking a subject to judge 
whether one stimulus is x-er than some 
other stimulus where x is any specified 
attribute, he set up the discrimination 
experiment in such a way that no at- 
tribute was specified. In the method 
of triads, the subject would be shown 
three patches of color, for example, and 
he would be asked to indicate which is 
the odd one with the implication that 
the remaining pair is more alike than 
any other of the three pairs. In this 
way the subject can make judgments of 
the degree of similarity or difference 
without having any specified attribute. 
Data collected in this manner can be 
transformed into the equation of com- 
parative judgment and the dimensional- 
ity of the domain can then be ascer- 
tained by the Young-Householder the- 
orem. Such a method can be used ex- 
perimentally to determine the dimen- 





THE MEASUREMENT OF VALUES 51 


sionality of the various sensory modali- 
ties. 

Perhaps the best known application 
of these experimental methods for the 
study of values is in the measurement 
of social attitudes. The most sensitive 
experimental procedure is to present the 
subject with pairs about which he is 
asked to make certain judgments. For 
example, he may be presented with pairs 
of nationalities, and he may be asked 
to judge for each pair which he would 
rather associate with. That type of ex- 
periment has been carried out in sev- 
eral ways. The judgments that are 
made by the subject depend, of course, 
partly on his own preferences which are 
closely related to his own nationality, 
and the judgments are also determined 
by the nationalities that are judged. If 
two groups of subjects are asked to 
make judgments of this kind, one can 
say on the basis of objective evidence 
which of the two groups is more tol- 
erant of other nationalities. At one 
extreme we would have people who are 
completely tolerant toward all nationali- 
ties. They would then also, of course, 
be completely indifferent about their 
own. Such people would have no na- 
tional loyalty or identification. At the 
other extreme we would have people who 
are said to be strongly prejudiced or 
biased. They would have extreme loy- 
alties to some nationalities and extreme 
dislikes for others. I doubt whether 
we should consider either of these two 
extremes to be ideal. 

Some years ago Fred Eggan wrote a 
master’s thesis in psychology before he 
went into the field of anthropology. In 
that master’s thesis he wanted to know 
the effect of different forms of question 
with reference to nationalities. He had 
five different questions representing dif- 
ferent degrees of intimacy. All five 
groups of subjects were given the same 
lists of pairs of nationalities, but there 
were different questions. One group 


had the question, Which of each pair 
of nationalities would you rather asso- 
ciate with? Another group had the 
same nationality lists, but they were 
given the question, Which would you 
rather have as a fellow student? An- 
other group had the question, Which 
nationality would you rather have your 
sister marry? The proportions were 
superficially quite different, but the rank 
orders of the nationalities were essen- 
tially the same. In this case, we would 
probably find that the form of the ques- 
tion has a tremendous effect on the dis- 
criminal dispersion but relatively little 
effect on the order of the nationalities. 
The effectiveness of comparative judg- 
ment for studies of this type should be 
exploitec further. 

In studying the measurement of so- 
cial attitudes, the attempt is sometimes 
made to validate such experiments in 
terms of overt behavior, but that is an 
error. Samuel A. Stouffer of Harvard 
wrote a doctor’s dissertation some years 
ago at the University of Chicago on this 
problem. He investigated social atti- 
tudes by means of statement scales in 
reference to the prohibition issue. He 
obtained data about his subjects as to 
their actual behavior on prohibition. 
He found that there was pretty fair 
agreement between what the subjects 
said on the attitude scales and how they 
actually behaved. I should like to point 
out that, while such a comparison is of 
considerable interest, it is not a valida- 
tion of the attitude scale. A man may 
be entirely consistent in what he says 
and in what he does about a contro- 
versial issue, and yet both of these in- 
dices may be dead wrong in reflecting 
his attitude. In order to determine a 
man’s attitudes in the sense of affective 
disposition about a controversial issue, 
it will be necessary for his friends to 
ask him privately when he is free to 
speak his mind and when he is not 
likely to be quoted. His personal atti- 





L. L. THuURSTONE 


tudes may or may not agree with what 
he says and what he does. Here again, 
attitudes are essentially subjective ex- 
periences which may or may not con- 
form with overt action. 

Another distinction in the study of 
social attitudes which is sometimes lost 
sight of is that the cognitive and the af- 
fective appraisals may be entirely inde- 
pendent. For example, a group of sub- 
jects may agree in their strong dislike 
of communism. Someone might give 
them an examination in order to show 
that the subjects actually do not know 
what they are talking about. That 
might very well be true, but the psy- 
chological fact is nevertheless inescap- 
able that the affective attitudes may be 
strongly for or against a stimulus even 
if there is a great deal of confusion 
about its cognitive description. 

The statement scale is not so sensitive 
as the paired-comparison procedure. It 
consists in a set of statements to which 


the subject responds by acceptance or 


rejection of each statement. In con- 
structing such a scale, one presents a 
large number of statements to a group 
of subjects whose principal qualifica- 
tion is that they can read English. 
These subjects are asked to indicate for 
pairs of statements which represents the 
stronger attitude for or against x, where 
x represents the psychological object to 
which the attitude scale refers. For 
rough survey purposes, the attitude 
scales are useful. 

An interesting application of these 
methods of studying values is to ap- 
praise the effects of propaganda. We 
made a large number of experiments on 
the effects of motion-picture films on 
the social attitudes of high school chil- 
dren. Statement scales and paired-com- 
parison schedules of various kinds were 
given before and after the showing of 
a motion picture. By this method we 
were able to ascertain whether a given 
picture had a significant effect and in 


what direction it did affect the chil- 
dren’s social attitudes. 

The method has also been applied in 
the study of international tensions by 
noting newspaper editorials. In one of 
those investigations a study was made 
with Chinese and Japanese newspaper 
editorials concerning each other, and it 
was shown, by treating key statements 
from the newspaper editorials, that the 
tensions increased at a very great rate 
before the two countries were at war. 
Quincy Wright has suggested in his po- 
litical science studies that such applica- 
tions of psychophysical methods might 
be useful in studying international ten- 
sions before they become very marked. 

An application of these subjective 
measurement methods which has not yet 
been made will be in the definition of 
the morale of a group. In general, the 
morale of a group is described by news- 
paper reporters and by others who mix 
their own value judgments with the 
characteristics of the group to be de- 
scribed. For scientific work we should 
have a definition of morale which is en- 
tirely independent of the value judg- 
ments of the observer. Such a defini- 
tion could be stated in terms of the 
dispersions of all of the debatable issues 
within the group. Other applications 
would be in the comparison of cultural 
and nationality differences as to the 
values that are considered to be essen- 
tial. It is unfortunate that most stu- 
dents of social psychology and political 
science are too descriptively minded to 
adapt the quantitative methods that 
may be available. 

Let us turn next to the experimental 
study of moral values. We have car- 
ried out several experiments in which a 
group of subjects was given a list of 
offenses that were presented in pairs. 
For each pair the subjects were asked 
to indicate which of the pair they con- 
sidered to be the more serious. On the 
basis of data of this kind and with the 





THE MEASUREMENT OF VALUES 53 


aid of the equation of comparative judg- 
ment, we ascertain the scale values and 
dispersions for these offenses. In one 
case we gave a group of high school 
students such a list of offenses and 
we determined the scale values and 
dispersions for these stimuli for three 
occasions. The first presentation was 
a day or two before they saw a film 
that described the life of a gambler. 
A few days after seeing the film they 
were given the second similar schedule. 
About six months later they were given 
the third schedule. The film described 
the life of a gambler and we wanted 
to know whether this film had an ap- 
preciable effect on the attitudes of the 
high school youngsters toward gambling. 
We found that they considered gambling 
to be a much more serious offense after 
seeing this film than they did before 
seeing the film. In a number of ex- 
periments of this type, we also found 
that the motion pictures had much more 


lasting effects than is ordinarily sup- 


posed. In many cases we found that 
only half of the effect of the film wore 
off in six months. It should be said, 
however, that these experiments were 
carried out in small towns in Tilinois 
where the children do not see so many 
movies as in the large cities. We car- 
ried out a similar experiment in the 
Hyde Park High School in Chicago 
where the children were given free 
tickets to a movie at the Tower The- 
ater, a few blocks away. There we 
found that the effect was very slight. 
Our interpretation was that one movie 
more or less for children in a large 
city high school makes very little dif- 
ference in their attitudes. These meth- 
ods of studying moral values could be 
used very effectively in the comparison 
of different groups in a large city. The 
groups might represent different nation- 
ality backgrounds and different religious 
backgrounds. It would be interesting 
to ascertain what these differences would 


be. Such social psychological studies 
would help us to understand the prob- 
lems of the extremely heterogeneous 
populations in the large cities. In a 
similar manner we have investigated ex- 
perimentally the summation effect in 
propaganda where the effect of a single 
stimulus does not show a statistically 
significant effect. 

Another interesting field of applica- 
tion is in experimental semantics. It 
would be useful, for example, to have 
an index of affective intensity for ad- 
jectives in a dictionary. Two adjectives 
may be equivalent as to cognitive mean- 
ing and yet differ widely in affective 
meaning. The words famous and no- 
torious might be examples. So are 
the words pleasant, gay, and hilarious. 
Such affective indices would be useful 
in translating a foreign language. 

We turn now to another type of psy- 
chophysical problem. In the psycho- 
physical methods that we have consid- 
ered so far the main problem was to 
allocate each idea or object to a sub- 
jective continuum which may be uni- 
dimensional or multidimensional de- 
pending on the nature of the problem. 
In most problems it is unidimensional. 
For example, if we ask subjects to judge 
the relative seriousness of offenses, we 
are dealing frankly with a unidimen- 
sional continuum, even though the dis- 
criminations may take place in a mul- 
tidimensional continuum. We_ have 
here an obverse psychophysical prob- 
lem. Having determined the subjec- 
tive space which describes a group of 
subjects as to their attitudes in some 
field, we now inquire whether we can 
predict in any way what these people 
will do. When we turn the psychophys- 
ical problem in this manner, we find 
some exceedingly interesting psycho- 
physical theorems of a new kind. I 
shall give a few examples. 

Consider two political candidates for 
an election. Let one of them have a 





54 L. L. THuURSTONE 


wide dispersion on the affective con- 
tinuum. By this we mean that some 
people are very enthusiastic about this 
candidate, whereas others actually hate 
him. Let the other candidate have the 
same average popularity, but assume 
that he has a narrow dispersion so that 
very few people are enthusiastic about 
him and very few people strongly dis- 
like him. If these two candidates come 
to an election, we should expect them 
to split the vote evenly. However, the 
more variable of these two candidates 
might introduce a third candidate of 
approximately equal popularity and who 
also has a narrow dispersion. Then we 
would have three candidates, one with 
wide dispersion on the affective con- 
tinuum, and two candidates of narrow 
dispersion, and all taree of them would 
be equally popular on the average. In 
such a situation, the more variable of 
the candidates would draw half the 
votes and the other two candidates 
would get twenty-five per cent each. 
These proportions would be altered 
somewhat depending on_ intercorrela- 
tions between the attitudes toward the 
candidates, but the principle can be 
illustrated in the general case for zero 
correlation. This principle is no doubt 
well known among politicians, but I 
doubt whether any of them have ever 
thought of this principle as a psycho- 
physical theorem. 

Let us turn to another simple exam- 
ple from the field of market research. 
Consider a mail-order house or a retail 
store which carries a limited number of 
neckties. They desire to please the 
majority of their clientele. The manu- 
facturers offer many hundreds or thou- 
sands of necktie patterns. If you turn 
to market research people with this 
problem, they may ascertain the 20 or 
30 or perhaps 50 of the most popular 
designs, and they may suggest that 
these be the designs that should be car- 
ried. But that is the wrong answer. 


Suppose that several hundred necktie 
patterns were submitted to a sample 
of the clientele. With such records one 
could rather easily determine not only 
which patterns should be carried, but 
also the number of patterns that should 
be carried in order to satisfy a specified 
proportion of the clientele. We would 
start with the most popular design and 
set that aside to be included. In the 
sample population we would then elim- 
inate all who chose that popular pat- 
tern. Then we would inquire about 
the most popular pattern in the remain- 
der of the sample population. That 
pattern would be set aside as the sec- 
ond design to be accepted. Eliminating 
those who chose that pattern, we would 
ascertain the most popular pattern in 
the remainder of the sample population. 
Proceeding in this way, we would come 
to the point where an additional pat- 
tern would increase the selection by 
only a very small percentage of the 
population and that would be the time 
to stop. In such a procedure we could 
determine the number of patterns as 
well as the designs which should be 
used in order to satisfy a specified pro- 
portion of the clientele. The ordinary 
solution of selecting the most popular 
designs would lead to a situation where 
some customers are confused by hav- 
ing many patterns which are equally 
acceptable while other customers find 
nothing to please them. The maxi- 
mum satisfaction will be derived by 
proceeding in some such way as I have 
outlined. There is nothing profound 
about this procedure, and yet it would 
probably be novel in market research. 
There are situations where problems of 
this sort can be of national importance. 
If it should be necessary to restrict the 
manufacture of civilian goods, then it 
might be important to encourage the 
manufacture of a limited number of de- 
signs for all sorts of things and to se- 
lect those designs in such a manner as 





THE MEASUREMENT OF VALUES 55 


to please the majority of the civilian 
population. In this manner the psycho- 
physical methods may be important in 
contributing toward national morale. 
Recently we made an experiment on 
the prediction of choice with regard to 
menus. In this problem we were con- 
cerned with the simplification of psycho- 
physical methods to the point where 
they would be practicable for survey 
purposes. The psychophysical methods 
of the laboratory are often too laborious 
to be used in practical surveys. It was 
decided to adapt the method of suc- 
cessive intervals for this problem. We 
presented a list of 40 foods on a suc- 
cessive interval schedule in which each 
subject was asked to indicate by a 
singie checkmark his relative cegree 
of like or dislike for each food item. 
There were nine short descriptive 
phrases which represented degrees of 
like and dislike for foods. This sched- 


ule of 40 items required less than five 


minutes for each of several hundred 
adult men subjects. In addition to this 
short survey schedule, we also pre- 
sented them with 16 menus in which 
they were asked to indicate what they 
would be likely to choose from each 
menu. For example, there were four 
lists of desserts, several lists of entrees, 
other lists of vegetables, and the like. 
For each menu the subjects were asked 
merely to check which they would se- 
lect from a given list. Vanilla ice 
cream occurred in several of the des- 
sert menus. The proportion of the 
subjects who select vanilla ice cream 
for dessert depends, of course, in part 
on their relative like or dislike for this 
dessert, but the selections would also 
depend on the competing items in the 
dessert list. By the application of the 
method of successive intervals and some 
theorems in psychophysics, we predicted 
the proportion of the subjects who 
would select each one of the items and 
there were 56 such predictions. These 


predictions were based entirely on the 
short, five-minute schedule for the 
whole list of 40 foods. We compared 
these predictions with the actual choices 
that the subjects made when they were 
confronted with the actual menus. The 
agreement was remarkable. The maxi- 
mum discrepancy was between 3 and 4 
per cent with one conspicuous excep- 
tion for a dichotomy, namely, roast beef 
and fried chicken. The ratings for these 
two items were both in the upper two 
categories and the discrepancy was there 
8 per cent, which was probably due to 
the effect of coarse grouping. The ex- 
periment demonstrated quite adequately 
that the prediction of choice can be ef- 
fectively made with very simple survey 
schedules if these schedules are properly 
analyzed. 

Some of these experiments deal with 
rather trivial values while others deal 
with socially more important values, but 
our principal concern here is in the de- 
velopment of those scientific methods 
which can be adapted over a wide range 
of values whether they be socially im- 
portant or trivial. 

We turn next to the application of 
psychophysical theory to some experi- 
mental problems in economics. For a 
long time there has been considerable 
interest in the measurement of utility, 
but the measurements have generally 
been indirect. Psychologists have been 
able to measure utility experimentally 
for over two decades, but economists 
have not until very recently expressed 
interest in these methods. In the last 
few years there seems to have been a 
marked change in the attitude of econ- 
omists to these problems. In principle, 
utilities can be measured for an indi- 
vidual subject, but it is easier experi- 
mentally to apply these methods to the 
measurement of utility for a group of 
subjects. Psychophysical theory lends 
itself well to a number of variations in 
the measurement of utility. For exam- 





56 L. L. THURSTONE 


ple, the utility of a purchase can be 
described as the algebraic sum of the 
utilities of the object and of the price. 
In this case, the utility of the object 
would presumably be positive, whereas 
the utility of the price would be nega- 
tive. The question then arises about 
the location of a rational zero point for 
the scale of utility. An experiment is 
now in progress to demonstrate an ex- 
perimental procedure for locating the 
zero point in the scale of utility. It 
seems reasonable that the prices of vari- 
ous competing objects should be checked 
with their utilities to ascertain for any 
specified population to what extent some 
objects are overpriced or underpriced. 
Survey methods are available for doing 
these things. In determining the zero 
point for the scale of utility, we are 
asking several hundred subjects to ex- 
press their preferences among various 
objects that might be given to them as 
birthday presents. Each of these sin- 


gle objects will then be given a value 


on the scale of utility. In addition to 
these judgments, we also asked the sub- 
jects to make a number of different 
judgments. We asked them whether 
they would prefer to receive gifts A 
and B or C. In this case they must 
judge whether the satisfaction from A 
and B is greater or less than the antici- 
pated satisfaction from the single birth- 
day present C. By judgments of this 
sort we expect to be able to locate the 
zero point of utility because the sum of 
the affective values of A and B com- 
bined should equal the utilities for these 
two objects taken separately. Within 
the range of the experiment with a small 
number of different objects to be se- 
lected, an additive theorem can be as- 
sumed to hold reasonably well. Dimin- 
ishing returns would probably not be 
noticeable within the choice of four or 
five different objects. 

In making these adaptations of psy- 
chological measurement theory to eco- 


nomics, one naturally wonders whether 
economics could be developed as an 
experimental science. Although I am 
not an economist, it has seemed to me 
entirely feasible that economics should 
be developed as an experimental science. 
In discussing this question with some 
of my friends in economics, I find that 
they are divided. Some of them insist 
emphatically that economics can never 
be an experimental science, while others 
are equally certain that this is possible. 
As an example we might consider the 
indifference function in economic the- 
ory. An indifference curve can be con- 
sidered as a curve showing the combina- 
tions of two commodities X and Y 
which have the same utility value. If 
the amounts of the two commodities 
are considered to be the x and y axes 
in a three-dimensional model, then util- 
ity can be considered as the ordinates 
which are perpendicular to the x-y 
plane. <An_ indifference curve would 
then be a horizontal section parallel 
to the x-y plane which represents con- 
stant utility. For different values of 
utility we would then have sections at 
different elevations which give a fam- 
ily of indifference curves. It has been 
shown that these indifference curves can 
be determined experimentally. There 
are many situations of controlled econ- 
omies where the shapes of these func- 
tions can be studied experimentally. 
Such situations are in occupied coun- 
tries or in prisons and in other situations 
with central control of prices. By al- 
tering the price of a commodity, the 
changes in the indifference curves can 
be noted experimentally. 

As a final example of the adaptation 
of psychophysical theory in the meas- 
urement of values, we shall consider the 
field of esthetics. If esthetics were to 
be regarded as a purely normative sci- 
ence, then we should expect the esthetic 
value of an object to be determined by 
its physical properties. Such an inter- 





THE MEASUREMENT OF VALUES 


pretation seems well-nigh hopeless. It 
seems much more fruitful to recognize 
that the esthetic value of an object is 
determined entirely by what goes on in 
the mind of the percipient. In this 
manner of looking at the problem we 
deal again with values that are subjec- 
tive experiences and which may vary 
from one person to another and cer- 
tainly from one culture to another. An 
esthetic object symbolizes human emo- 
tional experience and its resolution in 
a conceptual and abstract manner. Ex- 
cept in extreme cases the esthetic ex- 
perience is not itself emotional. It is 
essentially an abstraction. There is 
nothing absolute about the value of an 
esthetic object. The esthetic value is 
determined by the experience and ‘he 
attitudes of the observer. 

Some time ago I attended a series of 
seminars on esthetics at the home of 
one of my colleagues. Most of the par- 
ticipants in that seminar were from the 
humanities and the arts. The seminars 
were devoted to discussions about the 
theory of esthetics. In some of those 
discussions it occurred to me that the 
question at issue could be treated as a 
question of experimental fact, and I 
ventured to suggest how the psycho- 
physical methods could be adapted to 
obtain an empirical answer to the ques- 
tion at issue. It was an illuminating 
experience to discover that some of my 
friends in the humanities were hostile 
to the very idea of subjecting questions 
of esthetic theory to empirical inquiry. 
On one of those occasions a friend 
showed me a quotation from Aristotle 
that settled the matter for him. It was 
heresy when I suggested that we knew 
more about this problem than Aristotle. 
Artists are sometimes suspicious of the 
experimental study of artistic prefer- 
ences, and perhaps with some reason. 
Sometimes experimental studies are 
made in esthetics when the _ investi- 
gator is interested in secondary effects 


rather than in the esthetic experience. 
On the other hand, I have found some 
artists who are very much interested in 
such inquiry. A friend who is a por- 
trait painter frequently encouraged ex- 
perimental studies of this kind at the 
Art Institute in Chicago. Unfortu- 
nately I have not been able to induce 
many students of psychology to study 
experimental esthetics. 

In closing I should like to comment 
briefly on the social studies as science. 
It is unfortunate that the social studies 
have rather low prestige among the sci- 
ences. I believe that this is what we 
should expect because a large number 
of researchers in the social studies have 
not adopted the impartial, objective, 
and intellectual attitudes of -<-ience. 
Quite generally in these fiela ‘he writ- 
ers argue for social action of some kind, 
about the right and wrong ways of life, 
about what is good and what is evil in 
the opinions of the writers, about the 
good and the bad names and categories 
for describing their political friends and 
enemies. It is still true that social sci- 
entists rather frequently fail to study 
social phenomena as science to identify 
the forces at work without name call- 
ing and without injecting their own 
value judgments into what they are de- 
scribing. As long as social scientists 
fail to distinguish between propaganda 
and science they will have low prestige 
among the sciences. 


SUMMARY 


This paper has been concerned with 
the problems of a subjective metric. 
Social studies do not need to be quan- 
titative in order to qualify as science. 
Some of the most important experiments 
in science deal first of all with the de- 
scription of basic phenomena in a quali- 


tative way. It usually happens that 
quantitative methods appear with more 
intensive study. Here we have con- 





58 L. L. THURSTONE 


sidered some exploratory attempts to 
establish a subjective metric for the 


measurement of values. I have not 


succeeded in persuading social science 
students about the fascinating challenge 
to develop their field as science. To 
do so, we must free ourselves from the 
impulse for social action which has no 


place here. We should avoid problems 
in which we have an axe to grind. As 


citizens we have the privilege and the 
duty to participate in political elections. 
But when we work as scientists we 
should be aloof from the issues of the 
moment and to the chatter of the mar- 
ket place. Only in scientific detachment 
and objectivity can we eventually be 
helpful in developing the social studies 
as science. 


(Received April 8, 1953) 








Psychological Review 
Vol. 61, No. 1, 1954 


A NEURAL MODEL FOR SIGN-GESTALT THEORY ° 


JAMES OLDS 


Harvard University 


Whether we like it or not, a theory 
of learning points two ways. In one 
direction it points to better experiments. 
In the other direction it points to a 
model that would reproduce the aspect 
of behavior which the theory is used to 
explain; it is the unfinished blueprint 
for such a model. 

It is not so readily understood that 
the first pointing depends on the second: 
the theory must point to a model in 
order to point to better experiments. 
Quite often, because this is not under- 
stood, a further implication is over- 
looked, namely, the more nearly fin- 
ished the blueprint, the better the ex- 
periments will be. I will try to justify 
this proposition briefly in the next para- 
graph, but first I would like to empha- 
size its main consequence for the pres- 
ent discussion. This is that ‘“‘mechani- 
cal” or “neural” models are superior to 
merely “conceptual” ones because they 
do provide a more nearly finished blue- 
print. They tell us not only the type 
of relations that must occur, but the 
type of material in which these relations 
must occur, and how the relations can 
be built into this kind of material. 

The advantage of the completely 
specified model or mechanism would be 
to allow synthetic reproduction of the 
phenomenon under investigation. Syn- 


1 This paper is based on portions of a dis- 
sertation submitted to Harvard University in 
partial fulfillment of the requirements for the 
Ph.D. degree in social psychology. The work 
was supported by a Research Training Fel- 
lowship of the Social Science Research Coun- 
cil and by funds of the Laboratory of Social 
Relations at Harvard. The writer wishes to 
express his appreciation to Professor R. L. 
Solomon for his many helpful criticisms and 
suggestions. 


thetic reproduction gives the ideal solu- 
tion to the main scientific problem: it 
apportions the variance of the phenom- 
enon under investigation to the various 
causal constituents with no variance left 
over and not one too many causal con- 
stituents. Thus, it selects from the mul- 
titude of conditions that surround any 
phenomenon precisely the complex in- 
gredients that are necessary to produce 
the phenomenon. In so doing, it gives 
the basis for a descriptive language that 
will not be crowded with irrelevant con- 
cepts, nor lacking in crucial ones, but 
rather will have just one concept for 
each important variable and none left 
over. 

This would be the advantage of a 
completely specified model; the nearer 
we approach the completely specified 
model, the more we approach these ad- 
vantages. Thus, it is to our advantage 
to get more specifications into the un- 
finished blueprint for the model. TI be- 
lieve the further implication is that an 
approach toward a mechanical model 
will always be beneficial. 


THE ADEQUACY OF THE MODEL 
TO THE DATA 


A model may fail, however, in either 
of two directions. On the one hand, it 
may be so incompletely specified as to 
fail to provide an adequate descriptive 
language and to carve out crucial vari- 
ables. On the other hand, it may be 
more or less completely specified, but 
fail to reproduce the phenomenon under 
investigation. 

My contention is that Hull’s model 
(2, 3) is more completely specified than 
Tolman’s (6); in this sense Hull has 
the edge. Tolman, on the other hand, 





60 James Ops 


presents a model that seems to repro- 
duce more adequately the phenomena of 
learning and performance that are the 
subject matter of both theories; in this 
sense Tolman has the edge. I want to 
consolidate their gains. 

My purpose in the present paper, 
therefore, is to set forth a more com- 
plete blueprint for the model which Tol- 
man has presented. I will do this by 
giving a neural interpretation of Tol- 
man’s theory based in large part on 
Hebb’s (1) discussion of the properties 
of cell assemblies. 


ADVANTAGES OF THE MODEL 


As the proof of the pudding must be 
in the eating and not in any compli- 
cated rationalization, I will suggest at 
the end of this paper some of the 
advantages produced by the additions 
which I make to the Tolman theory. 
These come under three headings: (a) 


resolution of the problem of latent 
learning, (4) the stimulus control of 
ideas, and (c) the growth of approach 


motives. As it would do no good to 
expand on advantages before we have 
the theory, we proceed immediately to 











7,15 


Fic. 1. The cell assembly described by Hebb 


(1, p. 73) 


an introduction of the various impor- 
tant points of the model. 


Hebb’s Cell Assembly 


The cell assembly described by Hebb 
(1) is most simply conceived as a three- 
dimensional lattice of neural paths pro- 
viding several complete circuits, and al- 
ternative paths from each junction point 
so that when an impulse finds one of 
the transmission units refractory, an- 
other path allows the impulse to stay 
alive within the system. Therefore, the 
system has the capacity to reverberate. 
The assembly is most easily understood 
on the basis of the diagram in Fig. 1 
borrowed from Hebb (1, p. 73). Each 
of the arrows in the diagram represents 
a single transmission unit, a single path- 
way. Although these are not considered 
by Hebb to be individual neurones, but 
rather low-order systems of neurones, 
we will take them to be the lowest order 
of functional units for our present ex- 
planation. Each pathway is refractory 
for a moment after an impulse has trav- 
ersed it. Therefore, without alternative 
pathways reverberation would quickly 
die out, for the impulse would come 
back a second time before a pathway 
could recover. Each cell assembly con- 
sists in a number of these paths; the di- 
agram represents a cell assembly. From 
the diagram, we can see how alternative 
pathways make reverberation possible. 
The impulse enters along the pathway 
marked 1,4, it proceeds to 2,14, and 
then through 3,11 and 1,4 again. At 
this point, it finds 2,14 refractory, but 
there is an alternative path, 5,9. The 
impulse proceeds around according to 
the numbers and is allowed to stay alive 
within the system because neither all 
the pathways, nor too many of them 
are refractory at the same time. 

Hebb’s cell assembly as it stands 
has five properties that we should note 
before we proceed. The first is rever- 





A NEURAL MOopDEL FOR SIGN-GESTALT THEORY 61 


beration. When an impulse enters the 
assembly it can reverberate without fur- 
ther stimulation. Second, the cell as- 
sembly has relations to other internal 
assemblies so that it can be aroused by 
central facilitation. Third, it has rela- 
tions to the peripheral receptors so that 
it can be aroused by the environment. 
Fourth, it tends to have behavioral out- 
lets, that is, it tends to control behaviors 
while it is aroused. Fifth, it has at least 
two states or phases: it can be latent 
when it is not aroused, and it can be 
in a state of reverberation when it is 
aroused. 


Tue Four Puasss or IDEAS AND WANTS 


At this point we turn to the aspects 
of behavior that are to be explained by 
Hebb’s construct. There are two enig- 
matic terms avoided by S-R_ psycholo- 
gists and often by cognitive psycholo- 
gists because they seem so subjective 
and unfathomable, and so particularly 
refractory to mechanical analysis. These 
are “ideas” and “wants.” * No psychol- 
ogy is lacking a set of euphemisms for 
these terms, but few psychologies han- 
dle the problems well. S-R psychology 
speaks of “fractional components” in- 
stead of “ideas,” and of “antedating 
goal reactions’ instead of “wants.” 
Tolman faces the problem with less cir- 
cumlocution: he speaks of the “expec- 
tancy” or the “significate” instead of 
speaking of the “idea.” And he speaks 
of the “readiness” or the “demand” in- 
stead of the “want.” 

If, instead of searching for better and 
more satisfactory euphemisms, we take 
the terms as they stand with their more 
or less obvious, everyday meanings, and 
ask what we know about them, we find 


2 By the term “want” at this point, I refer 
to more than the basic physiological drives 
that underlie some (but not all) of behavior. 
Instead, I refer to the specific conceptualiza- 
tion of a goal that seems to precede most 
goal-directed activity in a human being. 


that we know quite a lot more than we 
might expect. And we also find that 
there is an interesting parallelism be- 
tween an analysis of ideas and an analy- 
sis of wants that suggests that they are 
not such different things as they might 
seem at first glance. 

The present analysis is going to be 
quite cursory and gross, for it is only 
to prepare the way for the model which 
is to come; it is to give some meaning- 
ful anchorage points for the technical 
material that is to follow. 

First, there are various phases found 
in the analysis of a single idea. Let us 
take as an example the idea of a red 
light (of the traffic control variety). 
At first the red light is seven or eight 
blocks up the road, and we are not even 
thinking about it. I will say that the 
“idea of the red light” is latent at this 
point. After a few moments, we are 


approaching the intersection, the light 
at the corner turns from green to yel- 


low, and for a very little while we are 
thinking about the red light and expect- 
ing it, but we are not seeing it. I will 
say the idea of the red light is now in 
a state of expectancy. But then the 
light turns red and we are seeing a red 
light. I will say the idea of the red 
light is in a state of perception. After 
we have sat behind the light for what 
seems like an interminable number of 
seconds, we become fed up with it, it 
seems to be lasting forever. I will say 
the idea is in a state of boredom. Fi- 
nally, the light turns green, we drive 
on and forget it. The idea of the red 
light is latent again. It is obvious that 
an idea has at least four distinct con- 
ditions or phases: (a) it is not even 
thought, (0) it is thought but not seen, 
(c) it is seen, and (d) it is palling. 
The second condition can be divided 
again and again; when the idea is 
thought but not seen, it can be a mere 
thought, an expectancy, a memory, and 
so forth, but we will ignore these finer 





62 James OLDs 


gradations in the present paper. For 
our purposes, the idea has four phases 
which we may call latency, thought, per- 
ception, and boredom. 

The most interesting thing about 
these four phases is that they are ex- 
actly and obviously paralleled by the 
four phases of a want. At first the 
want is latent; as for example when I 
am not thinking about food, and I do 
not want it. Next, something makes 
me think of food, and I notice that I 
am hungry. I start doing things that 
will get me fed; the want is now in a 
state of motivation. After that, I am 
being fed. The want is in a state of 
gratification. Finally, I am too full, 
and the want is in a state of saiiation. 
After I have waited for a while, the 
satiation disappears, and the want is 
latent again. Thus, the want has four 
phases which we may call latency, mo- 
tivation, gratification, and satiation. 
Note how closely these fit the phases 
of the idea. 

From the parallelism, one would be 
tempted to suggest that ideas and wants 
are much the same sort of things. I 
suggest that they are not distinguished 
as far as the kind of structure is con- 
cerned, i: only in terms of some power 
or “motive force” parameter. That is, 
an idea is a concept with a low motive 
force; a want is a concept with a high 
motive force. 


THE Four PHASES OF THE CELL 
ASSEMBLY 


The cell assembly, as we left it a 
few paragraphs back, has only two 


phases, latency and arousal. The state 
of arousal is a state of reverberation; 
an impulse enters the system along one 
pathway and reverberates within the 
system without further stimulus sup- 
port. 

We would like to find some character- 
istic of the cell assembly, implicit in 


Hebb’s description of it, to allow us to 
ascribe it four phases and thus use it 
as an adequate model for the ideas and 
wants we have just described. Par- 
ticularly, we would like to find two dif- 
ferent conditions of arousal, one cor- 
responding to perception or gratification 
and the other corresponding to thought 
or motivation. Analyzing these two 
phases, we find that the thought-moti- 
vation phase is characterized by a mini- 
mum of external stimulus support: it is 
a more or less autonomous internal re- 
verberation, and it does not seem to 
be terminated either of its own accord 
or by mere withdrawal of the arousing 
stimulus. Rather, this phase of expec- 
tancy or motivation is terminated by 
the presentation of the goal object in 
the environment. 

The perception-gratification phase, on 
the other hand, is characterized by a 
maximum of external stimulus support: 
it is not an autonomous internal rever- 
beration, it does seem to become sati- 
ated or refractory of its own accord, 
and it seems to go out immediately 
upon withdrawal of the arousing stimu- 
lus. This phase is the perception or 
enjoyment that is turned on by the goal 
object in the environment. 

Our problem is this: How can the 
same idea participate in an expectancy 
which is terminated by the goal object, 
and in a perception which is turned on 
by the goal object? The same idea 
seems to be turned off and on by the 
same object, which sounds ridiculous. 

I find the answer to this question 
in Hebb’s discussion of the conditions 
necessary for reverberation, an answer 
which shows that Hebb’s cell assembly 
is a much better model for ideas and 
wants than one might expect from a 
superficial glance. 

You will remember that in our de- 
scription of the cell assembly, we said 
it would reverberate because neither all 
the pathways, nor too many of them 








A NEURAL MODEL FOR SIGN-GESTALT THEORY 63 


are refractory at the same time. At the 
present point, this assertion becomes 
crucial. We may suggest that any stim- 
ulus which has a single or small number 
of connections with a given cell assem- 
bly would start a reverberation (a 
thought or motivation process in the 
assembly). A stimulus which has a 
large number of connections to many of 
the different pathways, on the other 
hand, would not set up a reverberation; 
instead it would “fire” the assembly. 
All pathways would be rendered refrac- 
tory (or relatively refractory) at the 
same time. In the continued presence 
of the strong external stimulation the 
activity of the assembly could be main- 
tained. But upon withdrawal of the 


external stimulus the assembly would 
be refractory, and activity would cease. 

For our purposes, then, the cell as- 
sembly has four phases or conditions. 
We will say it can be in a state of la- 
tency, in a state of reverberation, in a 


state of firing, and in a state of refrac- 
toriness. These correspond to the four 
phases of ideas and wants. For the 
first phase we have used in all cases the 
term latency. For the second phase, 
we render equivalent the terms thought, 
motivation, and reverberation. For the 
third phase the equivalencies are per- 
ception, gratification, and firing. For 
the fourth phase the terms are boredom, 
satiation, and refractoriness. The cell 
assembly is our mechanical model for 
ideas and wants. A cell assembly of 
low “motive force” is an idea; a cell 
assembly of high “motive force” is a 
want. We will go on now to a simpli- 
fied discussion of association. 


Tue ASSOCIATION OF IDEAS 


Again we turn to the aspect of behav- 
ior that is to be explained, and again 
we find a phenomenon which is rarely 
treated in contemporary psychology ex- 
cept with careful circumlocution. This 


is the association of ideas which is pro- 
duced within a human being by a suc- 
cession of stimuli in the environment. 
Each of us knows from his own ex- 
perience a great deal about the way 
an associational link between two ideas 
functions, but we do not often analyze 
the functioning carefully enough to be 
aware of its essential characteristics. 

I will take a simple example to make 
these characteristics explicit. Our sub- 
ject is unacquainted with his typewriter. 
The carriage is far to the right, and 
he perceives and pushes a key marked 
“Tabular.” The carriage jumps five 
spaces to the left and stops in a new 
position. First, there is an antecedent 
situation; then he makes a response and 
an outcome ensues. The antecedent sit- 
uation is the carriage far to the right 
plus the perception of the tabular key; 
we will call this A. The response is to 
push the tabular key; we will call this 
R,. The outcome is the carriage five 
spaces to the left; we will call this B. 
Thus, in the presence of A, R; leads to 
B. The A-R,-B learning sequence has 
taught our subject that A followed by 
R, leads to B. We may say there is 
now an association of the A idea through 
R, to the B idea. In the future, if B 
is wanted, and A is presented, our sub- 
ject will perform R,. Also, if A is pre- 
sented and R, should occur by accident. 
our subject will expect, and prepare for 
B. That is, if he wants the carriage 
moved from its A to its B position, he 
will now press the tabular key. And 
if he inadvertently presses the tabular 
key, he wiil expect and quite likely take 
some action to offset the movement of 
the carriage to its B position. 

After a certain response in the pres- 
ence of A has led to B, we say that some 
idea of A is associated with some idea 
of B. But the facts of behavior are 
these: (a) in the future if we make this 
particular response to A we will antici- 
pate or expect B. (5) In the future if 





64 JAMES 


we should happen to want B we would 
show some tendency to search out A 
and then to make this particular re- 
sponse that takes us from A to B. The 
perception of A now arouses some ex- 
pectancy of B, and the motivation of 
B induces motivation of A. The link 
seems to carry expectancy in the 4 to 
B direction, and motivation in the B 
to A direction. This will become clearer 
now as we lay out the specifications for 
our model in detail. 


A NEURAL MODEL FOR SIGN-GESTALT 
THEORY 


There are two undefined structural 
units of the model. These are the cell 
assembly and the response control unit. 
We presume at the outset that for any 
stimulus with which the subject has 
repeated commerce, a cell assembly be- 
comes established; thereafter, the stimu- 
lus is an unconditioned stimulus of the 
cell assembly. Further, we presume 
that for any response which becomes 


organized within the behavior repertory 
of a subject, a response control unit be- 
comes formed; thereafter, the response 
is elicited by the activation of the re- 


sponse control unit. These two forma- 
tive processes may occur at first more or 
less by chance; the rules of organization 
and growth given below will show how 
selectivity can be introduced after a 
chance generation of these 
units. 

In the exposition, cell assemblies will 
be designated by the lower-case letters 
of the early part of the alphabet, e.g., 
a, b, c. Response control units will be 
designated r;, ro, 73, and so forth. Stim- 
uli in the environment will be designated 
by the upper-case letters of the early 
part of the alphabet, e.g., A, B,C. Re- 
sponses will be designated R;, R., Rs, 
and so forth. 

The definitions or specifications and 
postulates are listed below. 


structural 


OLps 


I. The unconditioned stimulus. Each 
cell assembly has a stimulus threshold 
of firing which needs to be crossed by 
stimulation from the environment. A 
stimulus which crosses this threshold is 
an unconditioned stimulus of the assem- 
bly. The unconditioned stimulus of as- 
sembly a is A, that of } is B, and so 
forth. 

II. The conditioned stimulus. Each 
cell assembly has a stimulus threshold 
of reverberation which needs to be 
crossed by stimulation from the environ- 
ment (mediated by antecedent assem- 
blies as noted in VII below). A stimu- 
lus which crosses this threshold is called 
a conditioned stimulus of the assembly. 

III. The motive threshold. Each cell 
assembly has a motive threshold which 
must be crossed by combined positive 
motive force or combined negative mo- 
tive force (see Villa and & below). 

IV. Intrinsic motive force. Each cell 
assembly has an intrinsic positive mo- 
tive force and an intrinsic negative mo- 
tive force which contribute toward com- 
bined positive and negative motive force 
respectively (and toward the combined 
motive forces of its antecedents when it 
is reverberating, see VIII below). Thus, 
there are two separate force parameters 
of each cell assembly; it is as though 
there were a solution with two sepa- 
rately variable factors dissolved. 

V. The law of assembly activation. 
Both the motive threshold and one of 
the stimulus thresholds must be crossed 
at the same time for the assembly to 
become aroused (i.e., to fire or rever- 
berate). If the motive threshold is 
crossed, then: 

(a) the assembly will fire if the 
stimulus threshold of firing is crossed. 
Arousal ceases upon termination of this 
stimulus. 

(6) the assembly will reverberate if 
the stimulus threshold of reverberation 
is crossed (unless both stimulus thresh- 
olds are crossed, in which case the as- 





A NEURAL MODEL FOR SIGN-GESTALT THEORY 65 


sembly will fire). Reverberation con- 
tinues after withdrawal of this stimulus; 
it is terminated by firing. 

VI. The learning law of association. 
Two cell assemblies become related to 
one another by an associational rela- 
tion under the following circumstances. 
If a fires, and then r; is activated, and 
then & fires, an associational relation 
will be formed between a and } which 
passes through the response control unit 
r,. The cell assembly a@ will become the 
antecedent of the associational relation, 
and the cell assembly } will become the 
successor of the associational relation. 
They will be connected with one an- 
other through 7;. It is as though there 
were a wire connecting two terminal 
boxes a and & passing through a junc- 
tion box r,;; and certain characteristics 
of the flow across the wire determine 
what will happen in the junction box 
(see IX below). The associational rela- 


tion will be strengthened by further fir- 


ings of a followed by activation of r; 
and firing of b. It will be weakened by 
further firings of a followed by activa- 
tion of 7; when these are not followed 
by firings of 6. 

VII. The law of conditioned stimuli. 
In the future, the firing of the ante- 
cedent will be a conditioned stimulus 
for the successor (see II and Vb above). 

VIII. The law of the backflow of mo- 
tive force. In the future, the rever- 
berating of the successor will add two 
components of motive force, instrumen- 
tal positive motive force and instru- 
mental negative motive force, to the 
antecedent; these contribute toward re- 
spective combined positive and negative 
motive forces of the antecedent. A re- 
verberating successor adds these com- 
ponents not only to the antecedent, but 
through the antecedent to further ante- 
cedents; the intervening assemblies need 
not be aroused for this transmission to 
continue to further antecedents. 

(a) The combined positive motive 


force of an assembly is equal to the sum 
of its intrinsic positive motive force and 
its instrumental positive motive force. 
Similarly, combined negative motive 
force is equal to the sum of intrinsic 
and instrumental negative motive force. 
Either the combined positive motive 
force or the combined negative motive 
force of an assembly must be above the 
motive threshold in order for the as- 
sembly to become activated (see III 
above). 

(6) The instrumental motive force 
(positive or negative) which a rever- 
berating successor delivers to a near or 
distant antecedent is: (7) directly pro- 
portional to the combined motive force 
of the successor, (i) directly propor- 
tional to the strength of the weakest 
link in the chain of associational rela- 
tions between them, and (iii) inversely 
proportional to the number of assem- 
blies interpolated between them. 

IX. The law of performance. The 
likelihood of a response R, depends 
on the amount of facilitation and the 
amount of inhibition contributed to the 
response control unit 7;. Facilitation 
and inhibition are contributed to a re- 
sponse unit 7; only when its antecedent 
a is firing and its successor 6 is rever- 
berating. If a is firing and 6 is rever- 
berating, then: 

(a) Facilitation will be contributed 
to 7; in proportion to the amount of the 
difference between the combined motive 
force of the antecedent and the com- 
bined motive force of the successor if 
this difference is favorable to the suc- 
cessor. Therefore, (7) if the successor 
is less negative than the antecedent, the 
response will be facilitated; (i) if the 
successor is more positive than the ante- 
cedent, the response will be facilitated. 

(6) Inhibition will be contributed to 
r,; in proportion to the amount of the 
difference between the combined motive 
force of the antecedent and the com- 
bined motive force of the successor if 





66 JAMES OLDs 


this difference is favorable to the ante- 
cedent. Therefore, (7) if the successor 
is more negative than the antecedent, 
the response will be actively inhibited; 
(ii) if the successor is less positive than 
the antecedent, the response will be ac- 
tively inhibited. 

(c) If the facilitation is greater than 
the inhibition, then the response control 
unit 7; will be activated, and R, will oc- 
cur. If the inhibition is greater than the 
facilitation, then r; will not be activated, 
and R, will not occur. 

X. The law of motive growth and de- 
cline. The intrinsic positive or nega- 
tive motive force of an assembly grows 
and declines as a function of variables. 
I suggest the following postulates as a 
program for research. 

(a) The intrinsic positive or nega- 
tive motive force of an assembly is a 
joint, direct function of the number of 
transmission units in the assembly (see 
6b below) and the amount of positive or 
negative motive force internal to each 
transmission unit (see c, d, e below). 
Each transmission unit has both posi- 
tive and negative motive force internal 
to it. 

(b) The number of transmission 
units in an assembly tends to increase 
in proportion to the amount of time 
the assembly spends in a state of firing. 

(c) The amount of positive or nega- 
tive motive force internal to each trans- 
mission unit in the assembly tends to 
decrease in proportion to the amount of 
time that the assembly spends in firing. 

(d) The amount of positive or nega- 
tive motive force internal to each trans- 
mission unit in the assembly tends to 
increase in proportion to the amount of 
time that the assembly spends in rever- 
beration. 

(e) The rate of positive or negative 
motive growth during reverberation (see 
d above) will increase as a function of 
the combined positive or negative mo- 


tive force of the assembly during the 
period of reverberation. 

(f) The rate of positive or negative 
motive decline during firing (see c 
above) will decrease as a function of 
the intrinsic positive or negative motive 
force of the assembly during the period 
of firing. 


INTERPRETATION OF TOLMAN’S THEORY 


We turn now to sign-gestalt theory to 
show that our mechanical model does 
give interpretation to all of its impor- 
tant points. We will first interpret the 
chief terms of Tolman’s theory; then 
we will show how the relations postu- 
lated by Tolman are inferences from our 
model. 

Perception: this is a term which is 
not accented as basic by Tolman; im- 
plicitly, however, it has a very basic 
place in his theory. For it is not the 
presence of a stimulus in the environ- 
ment which controls behavior, in the 
Tolman formulation, but the “percep- 
tion” of the stimulus by the subject. 
Perception is always selective; stimuli 
are perceived in proportion to their rele- 
vance to motives (6, p. 35). Tolman 
defines a perception as “an expectation 
of the component of a sign gestalt when 
this expectation results primarily from 
present stimuli coming then and there” 
(6, p. 452). I believe this may be para- 
phrased simply by saying a perception 
is the apprehension of an object by a 
subject when this apprehension depends 
on immediate stimulation. Our me- 
chanical analogy for perception is the 
firing of a cell assembly. It requires 
both the presentation of the uncondi- 
tioned stimulus (Va) and adequate com- 
bined motive force (V). The latter 
postulate accounts for the selectivity of 
perception. 

Demand: this term is defined by Tol- 
man as an “innate or acquired urge” to 
get to or from some given stimulus, or 





A NeurRAL MopeEt FoR SIGN-GESTALT THEORY 67 


some physiological quiescence or dis- 
turbance (6, p. 441). Simply, this is 
a want; it is an appetite or an aversion. 
A demand in our mechanical system 
consists in either one of two states. In 
the appetite case, it consists in the re- 
verberation of an assembly whose in- 
trinsic positive motive force is sufficient 
to cross its own motive threshold; in 
this case, approach behavior will be elic- 
ited according to the law of the back- 
flow of motive force (VIII) and accord- 
ing to the law of approach performance 
(Xa, it). In the aversion case, it con- 
sists in the firing of an assembly whose 
negative motive force is sufficient to 
cross its own motive threshold. In this 
case, avoidance behavior will be deter- 
mined jointly by the firing negative as- 
sembly and a less negative (or positive) 
reverberating successor. This deter- 


mines behavior in the direction of the 
less negative successor according to the 
law of avoidance performance (IXa, 7). 


Sign-gestalt: this term is defined as 
the knowledge that a sign followed by 
a direction distance will lead to a sig- 
nificate, e.g., the knowledge that in the 
presence of A, R,; leads to B. Our me- 
chanical analogy for the sign-gestalt is 
two cell assemblies joined through a re- 
sponse control unit by an associational 
relation. The ‘sign is the antecedent: 
the direction distance is the response 
control unit; the significate is the suc- 
cessor. 

Sign-gestalt-expectation: this term re- 
fers to the expectation that a certain 
direction distance will lead to the sig- 
nificate; the expectation results from 
the fact that the sign is presented and 
perceived. Our mechanical analogy de- 
rives from the postulate that the firing 
of the antecedent arouses reverberation 
of the successor by the law of condi- 
tioned stimuli (VII). That is, if an 
associational relation joins a and } 
through 7;, then a’s firing arouses re- 
verberation (expectation) of b. 


Sign-gestalt-readiness: this term re- 
fers to a want for some means object 
by virtue of its instrumental relation 
to a demanded object. Our mechanical 
analogy here is the reverberation of a 
cell assembly whose intrinsic motive 
force is not sufficient to cross its own 
motive threshold. It requires reverbera- 
tion of a successor (VIII) and presen- 
tation of the conditioned stimulus of 
the assembly in question (V5). The 
reverberating successor will add a com- 
ponent of motive force to the assembly 
in question; thus the combined motive 
force of the assembly will be above 
threshold, and the conditioned stimulus 
will arouse reverberation. At this point 
the assembly in question will function 
as though it were a “demand.” How- 
ever, termination of its reverberating 
successor will terminate its own demand 
characteristics, as its instrumental mo- 
tive force supply will be cut off. 

Sign-gestalt learning: Tolman’s the- 
ory of learning is briefly the following. 
In any given training sequence, the sub- 
ject learns new sign-gestalts, depending 
on what he perceives. For example, 
first the animal is in the presence of 
stimulus A. On Tolman’s theorem of 
the selectivity of perception, the subject 
will perceive A provided that it is rele- 
vant to some present demand (6, pp. 35 
and 386). Second, the subject adopts 
a direction distance R2; that is, he per- 
forms behavior Ro. Third, when the 
behavior is done, he is in the presence 
of stimulus B. He will perceive B pro- 
vided that it too is relevant to some one 
of his present motives. If the subject 
has perceived both the antecedent A and 
the outcome B, then a new sign-gestalt 
is learned in the performance process; 
it is that in the presence of A, R,; leads 
to B. 

Implicit in this description of sign- 
gestalt learning there is a premise that 
comes into superficial conflict with Tol- 
man’s (6, pp. 343-344) attack on the 





68 JaMEs OLDs 


law of effect. The point is this: if the 
outcome B must be perceived in order 
for learning to occur, and if perception 
is contingent on motivational relevance, 
it follows that the outcome B must be 
either a goal or an instrumentality, a 
reinforcer or a secondary reinforcer, in 
order for learning to occur. But Tol- 
man’s attack on the law of effect sug- 
gests that possibly there is no need of 
B being a reward for learning to occur 
(6, p. 343). In justice we must say that 
Tolman (6, pp. 386-387) recognizes this 
superficial conflict, but he does not ex- 
plicitly resolve the confusion. . Our me- 
chanical model does, and thus it pro- 
vides a basis for reorienting the so-called 
“latent-learning” controversy (see This- 
tlethwaite, 5) as we will show in a mo- 
ment. 


LEARNING REQUIRES REINFORCEMENT 


Our mechanical analogy for sign-ge- 
stalt learning derives from the learning 
law of association (VI). Two assem- 
blies become related by an associational 
relation if @ fires, then r; is activated, 
then & fires. But the conditions for 
the firing of @ and 6 are outlined in 
the law of assembly activation (V). 
Both the motive threshold and the stim- 
ulus threshold of firing must be crossed 
before firing will occur. But in order 
for the motive threshold to be crossed, 
the assembly must have either sufficient 
intrinsic motive force (in which case its 
stimulus is a reinforcer) or sufficient in- 
strumental motive force (in which case 
its stimulus is a secondary reinforcer). 
Thus, there is no learning without re- 
inforcement. 

But our model does predict Jatent 
learning provided the B stimulus is a 
reinforcing stimulus. For, a change in 
the combined motive force of 6 can 
be immediately reflected in two other 
changes: (a) a change in the combined 
motive force of a and (0) a change in 


the likelihood of the activation of 7; 
while a is firing, both without any repe- 
tition of the A-R,-B sequence. This 
derives from the law of the backflow of 
motive force (VIII) and from the law 
of performance (IX). The implication 
is that a change in the value of the 
outcome B will change the value of the 
antecedent A and the likelihood of the 
response R, to stimulus A_ without 
any repetition of the A-R,-B sequence. 
Thus, learning which was latent when 
the combined motive force of 5 was 
insufficient to evoke performance will 
become evidenced when the combined 
motive force of 6 is changed by some 
operation. 

Our suggestion vis-a-vis the rather 
large experimental program which has 
centered around the latent-learning con- 
troversy is this: experiments which suc- 
ceed in making the outcome B suffi- 
ciently neutral with respect to the pres- 
ent motivational state of the subject will 
not give evidence of latent learning. 
We may just as well stop looking for 
learning without any positive or nega- 
tive reinforcement, for in these cases 
the outcome will not be “perceived.” 

Experiments will demonstrate latent 
learning, however, whenever the out- 
come is made motivationally relevant 
in a positive or negative direction dur- 
ing learning, if the motivational rele- 
vance is reversed (as from positive to 
negative) after training without any 
further repetitions of the training se- 
quence. In these cases, there will ap- 
pear (if enough subjects are run) first- 
trial evidence of changes in response 
likelihood; such first-trial changes can- 
not be predicted by Hull’s theory. Tol- 
man and Gleitman (7) have reported 
such an experiment and it has sustained 
this prediction. 

In summary, further experiments 
should show two things: (a) after 
A-R,-B training with a reinforcing stim- 
ulus B, changes in the value of B will 








A NEURAL MODEL FOR 


be reflected immediately in changes in 
the likelihood of the A-R, sequence 
without any further A-R,-B sequences 
required to mediate this change in like- 
lihood; but (&) learning will rarely be 
demonstrated in an A-R,-B sequence 
where B has no history as a reinforcer 
or a secondary reinforcer, or where B 
is completely irrelevant to a strong pres- 
ent motivation, because in these cases 
B will not be perceived. In the terms 
of our model, 6 will not fire. 


STIMULUS CONTROL OF IDEAS 


The objection has long been made to 
cognitive theories that they do not gen- 
uinely predict behavior because they are 
unable to specify clearly before the fact 
the conditions under which the so-called 
immanent or ideational determinants of 
behavior will operate. 

Our mechanical model for sign-gestalt 
theory takes a long step toward meeting 
this objection. The main cognitive de- 
terminants in Tolman’s system are per- 
ceptions, expectations, readinesses, and 
demands. Tolman groups the first two, 
but we separate them. Our model speci- 
fies stimulus conditions, or operations 
under the control of the experimenter 
for the control of each of these cognitive 
processes. 

Let us presume that our subject has 
been habituated to the sequence A-R;- 
B-R»2-C-R;-D. D is a primary goal, 
and thus this is the paradigm for any 
regularly repeated stimulus-response se- 
quence eventuating in a goal. The in- 
ternal organization resulting from the 
habituation will be a-r;-b-ro-c-rs-d. To 
arouse the “perception of A’ we must 
fulfill the conditions for the firing of a. 
Stimulus A plus some conditioned stim- 
ulus of d will suffice; for A is the un- 
conditioned stimulus of a, and the re- 
verberation of d assures the motivation 
of a. At the same time, we have ful- 
filled the conditions for the “expectation 


SIGN-GESTALT THEORY 69 
of B,” that is, the reverberation of 0. 
This is because a’s firing provides a 
conditioned stimulus for 5 and d’s re- 
verberation provides adequate motiva- 
tion; therefore ) reverberates and B is 
expected. Although the conditions for 
the arousal of the “perception of A” 
are identical with those for the “ex- 
pectation of B,” the conditions for the 
termination of these two states are dif- 
ferent. Firing of a will cease upon with- 
drawal of A; but reverberation of 6 will 
tend to continue until the presentation 
of B produces firing of 6. Next, the 
presentation of a conditioned stimulus 
for d combined with a conditioned stim- 
ulus for c will produce a “readiness for 
C.” This is because a conditioned stim- 
ulus combined with adequate motivation 
produces reverberation. The readiness 
will be terminated by presentation of C 
(which would fire c and thus terminate 
reverberation) or of D (which would 
cut off c’s supply of instrumental motive 


force by terminating the reverberation 


of d). Finally, it is quite obvious that 
the presentation of a conditioned stimu- 
lus for d arouses a demand for D, and 
the presentation of D itself terminates 
that demand. 

An experimental program’ which 
makes use of some of these specifica- 
tions will be outlined briefly in the next 
section. 


THE GrowTH oF AppRoACcH MOTIVES 


In conclusion, I am going to suggest 
briefly an experimental program for the 
investigation of the growth and decline 
of secondary approach motives based 
on the variables derived from the new 
model. 

In the first place, it has been sug- 
gested that the intrinsic motive force 
of an assembly is a joint function of 
the number of “transmission units” in 
the assembly and the “motive force’ 
vested in each unit (Xa). The first 





70 JAMES OLDs 


problem in growing a motive, therefore, 
is to get some transmission units into 
the assembly, i.e., to get an assembly 
to start with. To do this we must give 
our subject some commerce with a stim- 
ulus, and then assure the firing of the 
newly formed assembly for some periods 
of time (X04). Presume that we want 
to form a motive directed at stimulus 
B as a goal. We may form an assem- 
bly and assure its firing by habituating 
our subject to the stimulus-response se- 
quence A-R,-B-R.2-C in which C is a 
primary goal. This forms the cell as- 
sembly 6. We know the conditions for 
assuring the firing of 6, namely, that 
during the time intervals while B is pre- 
sented, if c is reverberating, 5 will be 
firing. During these periods of firing, 
6 will be recruiting transmission units 
(X5) but these units will be losing mo- 
tive force (Xc). Thus, we are creating 
a cell assembly but not a motive. 

In the future, however, the growth of 


positive motive force in 6 will be a joint 
function of time intervals of reverbera- 
tion of 6 (Xd) and the combined posi- 
tive motive force of 4 during these in- 
tervals (Xe), and the latter will be a 
function of the positive motive force 
in c, and the strength of the association 


between 6 and c (VIIID). To accom- 
plish time intervals of reverberation in 
5 we have to stretch out the time inter- 
val between A-R, and the presentation 
of B; that is, we have to give the con- 
ditioned stimulus which arouses rever- 
beration in 6 and then delay the uncon- 
ditioned stimulus which terminates this 
reverberation. Therefore, we delay the 
presentation of B with reference to its 
place in the habituation sequence. This 
delay should increase the intrinsic mo- 
tive force in 6 (Xd), and should result 
in a measurable increase in the reward 
value of the stimulus B. Increases in 
the reward value of B can be measured 
by changes in the subject’s tendency to 
pursue this stimulus; I will not go into 


specific measures at this point, but they 
have been developed. 

To accomplish a high combined mo- 
tive force in 6 during intervals of rever- 
beration, we have to assure a strong as- 
sociational relation between 6 and c, 
and we have to make sure that c is re- 
verberating during the delay. To vary 
combined motive force, then, we can 
vary the primary goal C, or vary the 
amount of habituation which establishes 
the associational relation. 

In the future, the decline of positive 
motive force in 6 will be a similar joint 
function of time intervals of firing of 
6 and the intrinsic motive force of 6 
during those intervals of firing. The 
specific variables here are quite obvious, 
and I will not detail them here. 

Experiments to carry out this pro- 
gram have been designed and some com- 
pleted. Two experiments investigating 
motive force in 6 as a function of the 
delay of B have shown that after habitu- 
ation this delay does produce significant 
motive growth (4). Experiments to 
test the effects of other variables are 
in progress. 


SUMMARY 


A mechanical model for sign-gestalt 
theory based on Hebb’s (1) discussion 
of the cell assembly has been outlined. 
The cell assembly is used as the struc- 
tural model for both “ideas” and 
“wants”; these two terms are rendered 
equivalent except that wants tend to 
have a higher motive force parameter 
than ideas. Cell assemblies have two 
kinds of activation, reverberation (cor- 
responding to “thought” or “motiva- 
tion”) and firing (corresponding to 
“perception” or “gratification’’). 

The model provides for the formation 
of associational relations among cell as- 
semblies when there is a succession of 
stimuli in the environment. For exam- 
ple, if the objective stimulus-response 
sequence is A-R,-B and so forth, where 





A NEvuRAL MopeEt For SIGN-GESTALT THEORY 71 


A and B are stimuli, then an internal 
associational relation will be formed 
a-r;-b, where a and Bb are cell assem- 
blies, and r; a response control unit. 
After an associational relation has thus 
been formed between cell assemblies a 
and } through the response control unit 
r,, the firing of a will tend to arouse re- 
verberation in 4, and reverberation in } 
(aroused from some other quarter) will 
add to the motive force of a and a’s 
further antecedents. Thus, the associ- 
ational relation passes stimulation for- 
ward from a to 5 and motivation back- 
wards from 6 toa. Cell assemblies have 
two thresholds, a stimulus threshold 
and a motive threshold; both must be 
crossed simultaneously before any sort 
of activation will occur. The stimulus 
threshold may be crossed by either a 
“conditioned stimulus” (i.e., a firing an- 
tecedent) or an “unconditioned stimu- 
lus”; with adequate motivation, the for- 
mer will produce reverberation, the 
latter will produce firing. The motive 
threshold must be crossed by the in- 
trinsic motive force of the cell assembly 
or by a reverberating successor. Action 
is elicited when the antecedent assembly 
of a response control unit is firing, and 
the successor of the same relation is re- 
verberating, and there is a motivational 
balance across the response favorable to 
the outcome. 

The position adopted here represents 
an expansion of the position presented 
by Hebb (1). Hebb conceives facili- 
tation as flowing both ways across an 
associational relation. However, he 
does not anywhere explicitly recognize 
the necessity that one particular kind 
of facilitation, namely, that which is 
here called motive force, can be con- 
ceived only as flowing from associational 
successor to associational antecedent if 
the problem of motivation is to be 
solved. I do not mean here that time 
in the central nervous system flows 
backwards. There is no hocus-pocus 


or magic here. My argument is sim- 
ply that when cell assemblies are estab- 
lished in a communicating chain of cir- 
cuits by the succession of their stimuli 
in the environment, then motivational 
flow will be from the representor of the 
successor to the representor of the ante- 
cedent. 

The model is used to provide a reori- 
entation of the latent-learning contro- 
versy. Latent learning is predicted in 
the sense that a change in the value of 
an outcome will change the likelihood 
of its preceding responses without fur- 
ther repetitions of the responses to me- 
diate this change of likelihood. But the 
model fails to predict learning without 
reinforcement, for a stimulus must have 
value to be perceived (a cell assembly 
must have motivation in order to fire). 
On this basis, a change of focus in 
latent-learning experiments is suggested. 

The model is used further to provide 
a new basis for research on the question 
of the functional autonomy of motives. 
Full-fledged learned drives are pre- 
dicted, and the variables in their growth 
and decline are suggested. In general, 
it is suggested that the firing of an as- 
sembly increases the number of trans- 
mission units in the assembly, but de- 
creases the motive force allocated to 
each transmission unit. Thus, it in- 
creases the size of the potential moti- 
vating unit, but decreases its motive 
force. Motive force, however, will grow 
later as a joint direct function of time 
intervals of reverberation, and instru- 
mental value during those time inter- 
vals, and the size of the reverberating 
cell assembly. Firing will later tend to 
extinguish the motive force of an as- 
sembly. 

The implication is that after habitu- 
ation of a subject to a stimulus-response 
sequence such as A-R,-B-R2-C where 
A, B, C are stimuli, C being a primary 
reward, then the lengthening of the 
R,-B time interval will tend to produce 





JAMES OLDs 


increments in the intrinsic reward value 
of the stimulus B, and lengthening of 
the time interval of presentation of B 
will tend to produce decrements in this 
intrinsic value. Experiments validating 
the first half of this generalization have 
been performed (4); others are in prog- 
ress. 
REFERENCES 
Hess, D. O. The organization of behavior. 
New York: Wiley, 1949. 
. Hutz, C. L. Principles of behavior. 
York: Appleton-Century, 1943. 
3. Hutt, C. L. Behavior postulates and 
corollaries—1949. Psychol. Rev., 1950, 
57, 173-180. 


New 


4. Otps, J. The influence of practice on the 
strength of secondary approach drives. 
J. exp. Psychol., 1953, 46, 232-236. 

THISTLETHWAITE, D. A critical review of 
latent learning and related experiments. 
Psychol. Bull., 1951, 48, 97-129. 

. Totman, E. C. Purposive behavior in ani- 
mals and men. New York: Appleton- 
Century, 1932. 

7. Torman, E. C., & GreitTman, H. Studies 
in learning and motivation: I. Equal 
reinforcements in both end-boxes, fol- 
lowed by shock in one end-box. J. 
exp. Psychol., 1949, 39, 810-819. 


(Received April 8, 1953) 








Psychological Review 
Vol. 61, No. 1, 1954 


THE PLACE OF PHYSIOLOGICAL CONSTRUCTS IN A 
GENETIC EXPLANATORY SYSTEM * 


GUDMUND SMITH 


University of Lund, Sweden 


There are various ways of explaining 
behavior events physiologically. Let us 
distinguish here between (a) the use of 
physiological data, or of constructs de- 
rived from such data, and (6) the use 
of constructs which need not necessarily 
be verified under the microscope or in 
the EEG. Some of the moré advanced 
psychological theories are based on 
hypothetical constructs, as, for exam- 
ple, Hebb’s theory (4) and Klein and 
Krech’s “conductivity” concept (5, 7). 
Such brain models seem to serve as sub- 
stitutes for psychologically defined mod- 
els partly because their units of analysis 
are easy to ‘conceptualize, to handle. 
The present paper is, however, pri- 


marily concerned with the first, less 
sophisticated and more common kind 
of physiological theorizing in psychology 
indicating that physiological processes 
are the manifest reality underlying all 


behavior events. This approach has 
often been criticized, and we need not 
repeat the criticism here (8, 11, 14). 
Instead, the belief that physiological 
data represent the basis and origin of 
mental processes will be used here as a 
convenient starting point for further in- 
quiry into the place and role of physio- 
logical constructs in psychology, espe- 
cially in a genetic frame of reference. 
The assumption that physiological 
facts represent a “basic level” in the 
individual, the last link in the explana- 
tion of mental processes, is part of a 
more general assumption that behavior 
data have to be referred directly to 


1 The author wishes to express his gratitude 
to Drs. George S. Klein and Daniel J. Levin- 
son for valuable criticism. 


physical objects, inside or outside, in 
order to be understood at all. As sug- 
gested already by Natorp and Cassirer, 
however, psychology need not adopt this 
traditional method of physics and physi- 
ology but can (and should) adopt a 
method of its own. The aim of this 
method is not to make new constructs 
in the same objectivizing direction as 
the natural sciences, but to reconstruct 
physical objects by tracing them back 
to their origin, the experiencing sub- 
ject. Instead of using hypostatized 
constructs, such as body structure and 
outside objects, as an explanatory ba- 
sis for mental processes, the psy- 
chologist should analyze the constructs 
themselves with respect to their gene- 
sis in mental processes. Consequently, 
a physical-physiological unit might be 
regarded as the outcome of a more or 
less condensed series of behavior events 
(perception, concept formation, etc.), 
the early stages of which are the pre- 
requisite for the later, more adapted 
and objectivized ones. 

An explanatory model concerned with 
physical-physiological categories is a 
generalized, abstract conception of re- 
ality, in many respects the end prod- 
uct of the conceptual development of 
Western science. Similarly, a physi- 
calistic (“reality-oriented”) frame of 
reference accepted by the individual 
can be described as the result of a far- 
reaching emotional-intellectual sociali- 
zation. Piaget and Rapaport, among 
many others, follow in detail this de- 
velopment from primary to secondary 
stages in our cognitive schemata and 
thought processes, this acceptance by 
degrees of a common, objective knowl- 





74 GUDMUND SMITH 


edge, of detours in thinking (9, 10, 12). 
Let us, therefore, understand physio- 
logical constructs or facts as signs of a 
more or less objectivizing (physicaliz- 
ing) set or point of view in human be- 
ings; let us regard their role as frames 
of reference for reality-testing in the 
individual’s development, his adaptation 
to a stabilized world. The proposition 
is, then, that the physiological “reality” 
determines behavior, not merely as a 
number of causal factors behind the 
“mental surface” but as a conception 
in the individual himself of human na- 
ture, of reality. 

Emmert’s law, stating that the ap- 
parent size of an afterimage varies di- 
rectly as to the subject’s distance from 
the projection field, may serve as an il- 
lustration (13). According to the prop- 


osition, it can be assumed that these size 
relations hold true when the experienced 
world of an individual (his relevant re- 
gion) is conceptualized in a physicalis- 
tic, “accurate” way. 


This implies that 
afterimage and screen must become iso- 
lated from each other, the afterimage as 
a “subjective” and the screen as an 
“objective” phenomenon. The after- 
image, conceived of in this conventional, 
physicalistic way, is a constant nerve 
process; the screen is a nerve process 
changing in inverse proportion to the 
screen’s distance from the eye. Natu- 
rally, the subject need not know any- 
thing about retinal areas and the like, 
only the formal differences between the 
stable (inside) and the changing (out- 
side) reference systems. As pointed out 
in an earlier discussion on Emmert’s 
law. the arrangements in most after- 
image experiments of this type favor 
an isolation of image and screen, favor 
the analytic set necessary to diminish 
size constancy as far as possible (13). 

This being true, the relations in the 
world we perceive must become equiva- 
lent with the relations in the physio- 
logical schema. When now the screen 


is moved to or from the subject the 
area of stimulated nervous tissue will 
be extended or diminished—in linear 
proportion to the distance—but the 
afterimage (as excited area) remains 
constant. Hence, the afterimage will 
be small or large, respectively, as com- 
pared with the excited area of the pro- 
jection screen. And the same relations 
appear for the perceived world of our 
“objective” subjects; their afterimages 
conform to Emmert’s law. But as soon 
as the conceptual schema is less devel- 
oped, or, as soon as it is different, there 
will be deviations from the rule. Chil- 
dren, for instance, often think that the 
afterimage is a :eal object like the pro- 
jection screen, i.e., an object the size 
of which varies in the same way as 
other external objects. Consequently, 
their afterimages do not increase or de- 
crease in relation to the screen at vari- 
ous distances but are apparently size- 
constant (13). In many adults, too, 
a negative afterimage (or an eidetic 
image) is first considered to be an 
object “out there”; not until late in 
a series of experiments is the size con- 
stancy overcome. 

Thus, the variations in apparent size 
of projected afterimages differ among 
people because the conceptual frames 
of reference adopted by them are dif- 
ferent. While a physiologist would 
probably prefer to say that the after- 
image follows Emmert’s law because 
of an underlying, constant nerve-process 
(which might be unstable in children 
and some adults),? the more reasonable 
explanation, considering the deviations 
reported above, seems to be that the 
individual, for some reason or other, has 
adapted a conceptual schema in full 
agreement with a world of linear physi- 
cal relationships. It is now easy to see 


2The more advanced physiologizing psy- 
chologist would, of course, use a hypothetical 
variable to explain deviations in the law. 
This kind of theorizing will be discussed later. 





PHYSIOLOGICAL CONSTRUCTS IN A GENETIC EXPLANATORY SYSTEM 75 


why many a theory bound to manifest 
physical structures or observed physio- 
logical processes succeeds in explaining 
only specific and limited forms of be- 
havior; and one understands why some 
physiological psychologists are eager to 
make “pure perception” the main object 
of a psychological science. The classi- 
cal experimental psychology has some- 
times been able to explain response only 
because the (conventional) physicalistic 
view is generally accepted in our society. 
Percepts can be considered as represen- 
tations of behavior events within a more 
or less normalized framework of exter- 
nal reality, and, therefore, they must 
partly agree with a popular, physical- 
istic model of the world. 

The assumption that people perceive 
(behave) according to conceptual pat- 
terns as developed in their life history 
is not new; indeed, it has been stressed 
by Jackson, Head, Gelb, Stern, Cassirer, 
and many of their contemporaries, and 
later by students of perception and 
personality (6). Studies of cultural 
factors in perception, as, for example, 
comparisons of Rorschach responses in 
Western communities and primitive 
tribes (2), also tend to support the 
assumption; religion, customs, preju- 
dices, the whole reality imposed on 
us by our society seems to determine 
what we actually perceive.’ Cases of 
brain-injured people perhaps illustrate 
the point most clearly. One of Gelb 
and Goldstein’s subjects, for example, 
did not see a red color as red in gen- 
eral but only as a specific hue related 
to well-known objects (e.g., straw- 
berry), because his approach was non- 
symbolic, because his conceptual sche- 
mata lacked centers for a categorized 
perception of color. His vision in a 
narrow sense was not impaired, how- 
ever; he was supposed still to have re- 


3 The developments in this field have been 
excellently summarized and commented on by 
Dennis (1). 


ceptors for red “in general.’ But the 
subject himself could not accept this 
abstraction any more (3). Psychoso- 
matic medicine can furnish us with 
further data, e.g., the acceptance of a 
somatic cause of mental troubles may 
result in somatic symptoms. 

Before concluding this discussion let 
us develop the considerations once 
again, but now in terms familiar to 
the traditional psychologist. It is not 
necessary to avoid the stimulus-response 
model altogether in order to show why 
physical facts often fail to explain 
experience and behavivr data. Stim- 
uli (from outside) and physiological 
processes have been defined above as 
objective, generalized conceptions of 
reality as developed in the empirical 
tradition of natural science. In the 


stimulus-response model, behavior is in- 
fluenced by external stimulation as well 
as internal (body physiology). °. the 
response is not necessarily directly ue- 


termined by this stimulatien and co- 
herent with its properties; it is, instead, 
an expression of how the stimulation 
has been received and “acknowledged.” 
A behavior event will become the im- 
mediate reflection of a physical-physio- 
logical process only if this process is 
conceived of as “reality” by the sub- 
ject. As soon as the individual’s con- 
ception of reality is less “objective,” 
less socialized, the response cannot and 
will not be a mere prolongation of stim- 
ulus (inside or outside). The response 
or behavior, defined as the outward ex- 
pression of our experienced world or 
relevant region, has an immediate physi- 
cal-physiological basis only when this 
world is a stimulus reality. 

The physiological “level” does not 
represent the origin of a mental devel- 
opment but a stage in it, often (but 
not necessarily) the end result of the 
socialization process of thinking (cf. 
4). It seems to be meaningless to ask 
for physiological facts underlying be- 





76 GUDMUND SMITH 


havior phenomena of an individual with- 
out knowing whether or not he has ac- 
cepted the generalized cognitive schema 
to which these facts belong. This might 
explain why neurological models as de- 
scribed in the introduction had to be 
extended over the traditional boundaries 
of a matter-of-fact science in order to 
cover more than limited forms of behav- 
ior. If, for instance, an individual per- 
sists in behaving abnormally in spite 
of the fact that all known neurological 
functions in him seem to be normal, if 
he refuses to adopt the neurologist’s re- 
ality and is solely governed by his own 
“unsocialized” experience, it becomes 
necessary to introduce a hypothetical 
construct (e.g., integration of brain 
processes), the derivations of which 
should be able to explain all behavior 
deviations, even those without a known 
physical basis or with an imagined one. 
This means that neurological constructs 
in psychology must be more concerned 


with the reality represented by the wide 
developmental range of human experi- 
ence (behavior) than with the limited 
reality of manifest physical facts and 


physiological observations, i.e., these 
hypothetical constructs must remain ba- 
sically psychological in spite of the 
physiological language. 

The empirical question is, however, 
how the generalized behavior has de- 
veloped in different individuals, or why 
it has developed in some individuals but 
not in others. A physicalistic schema as 
accepted by the individual thus gets a 
personal significance; it may, for in- 
stance, be looked upon as a communica- 
tion or defense mechanism, as a cogni- 
tive style, etc. (17). The physiological 
conception of the world, the impersonal 
behavior, like all behavior phenomena, 
ought to be genetically explained (16). 


REFERENCES 


1. Dennis, W. Cultural and developmental 
factors in perception. In R. R. Blake 
& G. V. Ramsey (Eds.), Perception: 


an approach to personality. New York: 
Ronald, 1951. Pp. 148-169. 

. Du Bors, Cora. The people of Alor. 
Minneapolis: Univer. of Minnesota 
Press, 1944. 

. Gers, A., & GotpstEIn, K. Psychologische 
Analysen hirnpatologischer Faille. X: 
Ueber Farbennamenamnesie. Psychol. 
Forsch., 1924, 6, 127-186. 

. Hess, D. The organization of behavior. 
New York: Wiley, 1949. 

. Kessen, W., & Kimepre, G. A. “Dynamic 
systems” and theory construction. Psy- 
chol. Rev., 1952, 59, 263-267 

. Krier, G. S., & Krecn, D. The problem 
of personality and its theory. J. Pers., 
1951, 20, 2-23. 

. Krern, G. S., & Krecn, D. Cortical con- 
ductivity in the brain-injured. J. Pers., 
1952, 21, 118-148. 

. Lewin, K. A dynamic theory of person- 
ality. New York: McGraw-Hill, 1935. 

. Pracet, J. Principal factors determining 
intellectual evolution from childhood to 
adult life. In Factors determining hu- 
man behavior, Harvard Tercentenary 
Publ. Cambridge: Harvard Univer. 
Press, 1937. Pp. 32-48 

. Pracet, J. La naissance de V’intelligence 
chez Venfant. Neuchatel: Delachauz & 
Niestle, 1948. 

. Pratt, C. C. The logic of modern psy- 
chology. New York: Macmillan, 1939. 

. Rapaport, D. Toward a theory of think- 
ing. In D. Rapaport (Ed.), Organiza- 
tion and pathology of thought. Se- 
lected sources. New York: Columbia 
Univer. Press, 1951. Pp. 689-730. 

3. SmituH, G. Psychological studies in twin 
differences. With reference to after- 
image and eidetic phenomena as well as 
more general personality characteristics. 
Lund, Sweden: Gleerup, 1949. 

. SmitH, G. Interpretations of behavior se- 
quences. With respect to a _ radical 
change in the objective situation. Lund, 
Sweden: Gleerup, 1952. 

. SmitH,G.. Sprache und Erlebnis. 
1952, 18, No. 1, 78-86 

. SmitH, G. Development as a psychologi- 
cal reference system. Psychol. Rev., 
1952, 59, 363-369. 

. SmitH, G., & Krier, G. S. Cognitive con- 
trols in serial behavior patterns. J. 
Pers., in press. 


Theoria, 


(Received April 8, 1953) 





A NOTE ON STIMULUS INTENSITY DYNAMISM (VY) 


FRANK A. LOGAN 


Institute of Human Relations, Yale University? 


In the recent version of his theory 
(4), Hull postulates as an intervening 
variable, stimulus intensity dynamism 
(V), which is defined as a function of 
the intensity of the stimulus and 
which enters multiplicatively into 
the determination of excitatory poten- 
tial. The choice of a theoretical as- 
sumption is, of course, the sight of the 
theorist so long as useful predictions 
follow. However, this paper will 
attempt to show how Hull might have 
deduced the relevant empirical phe- 
nomena from his theory without the 
use of V. 

There are four general data areas 
for which V was especially designed. 
Let us summarize these and then pro- 
pose an alternative description. 

The first area is the classical condi- 
tioning situation where, for example, 
an increase in illumination is followed 
by a UCS. If two groups of subjects 
are exposed to this situation, where 
all known relevant variables are iden- 
tical with the exception of the inten- 
sitv of the CS (i.e., the amount of in- 
crease in brightness) the probability 
of the CR is greater for the group with 
the more intense CS (e.g., 5).2. Hull 
would deduce this result on the basis 
of the difference in V between the two 
groups. 

Let us, however, recognize that, be- 
tween trials, the subject is in the con- 


1 The writer is indebted to Drs. Mark A. 
May, Neal E. Miller, and Burton S. Rosner 
for a preliminary reading of the manuscript. 
A research project designed to test quantita- 
tively several of the differential predictions 
herein contained is being supported by a 
grant from the National Science Foundation. 

2 This generalization is not unequivocally 
supported (e.g., 2). Kessen (5) has suggested 
a possible analysis of the conflicting results. 


textual environment (S,.) containing 
a dimly illuminated disk, and that any 
occurrence of the response to S,, is not 
reinforced by the UCS. When the 
organism is in the more brightly illu- 
minated environment (S;), the occur- 
rence of the response is repeatedly 
rewarded. The situation becomes a 
discrimination problem in which rein- 
forcement follows the response to S; 
but not to S,... For a second group of 
subjects, also nonreinforced for re- 
sponding to S,., the rewarded stimulus 
complex is a still more brightly illu- 
minated environment (Sz). It fol- 
lows that, since the difference between 
S.. and S, will be greater than the 
difference between S,,. and S; (assum- 
ing that similarity is a monotonic 
function of stimulus intensity), there 
will be greater generalization of the 
inhibition conditioned at S,,. to S; than 
to Se. Hull has provided the deriva- 
tion that the net discriminatory excita- 
tory tendency (sEp) will be greater at 
the positive stimulus the greater the 
difference between the two stimuli. 
Therefore sEp will be greater for the 
group with S» as the CS than for the 
group with S,; a greater probability 
of CR is expected at the stronger 
stimulus. 

In this derivation, we assumed that 
the CS represented an increase in the 
intensity from a zero or minimal value. 
If, however, the CS were a decrease 
(so that, between trials, the illumina- 
tion would be brighter than any 
stimulus value used), the analysis 
here presented would lead to the ex- 
pectation that a group with the more 
intense CS (but a smaller change from 
the intertrial situation) would per- 
form more poorly than a second group 





78 FRANK A. LOGAN 


with a weaker CS. That is, the non- 
reinforced S,. in the derivation would 
be at the upper end of the intensity 
continuum, and inhibition would gen- 
eralize down toward the less intense 
values. It would also be possible to 
have the stimulus at some intermedi- 
ate value between trials, and for one 
group to have a lower intensity serve 
as the CS, and for another group, a 
higher intensity. Stimulus intensi- 
ties appropriately chosen as equal 
j.n.d. distances away from the inter- 
mediate stimulus should give the same 
probability of CR even though one is 
more intense than the other. The 
postulates containing V would be 
forced to deduce that the difference 
between the groups favoring the more 
intense CS would still obtain even 
under these diverse conditions. 

Tke second set of data for which 
Hull has found it expedient to employ 
V concerns those experiments dealing 
with the time interval between the 
CS andthe UCS. These data suggest 
that optimal conditioning will obtain 
at some asynchronism around one- 
half second, and that intervals either 
longer or shorter are less effective 
(e.g., 6). For this reason, Hull's 
system postulates a stimulus trace 
which changes as a function of time, 
and for which a molar stimulus equiv- 
alent is calculated for substitution in 
the equation for V. The trace rep- 
resents a changing dynamism, and the 
level of conditioning is assumed to 
depend upon the V occasioned by a 
trace of the appropriate age. 

The present position would suggest 
a somewhat different interpretation of 
the stimulus trace: the numerical 


value of the trace describes the degree 
to which a trace of that age represents 


from the conditions of 
stimulation prior to the onset of the 
stimulus. The trace function thus 
states that the onset of a stimulus 


a change 


produces a continuous change in the 
stimulus complex, rising rapidiy to a 
maximal difference at about one-half 
second, and thereafter being reduced 
until the stimulus complex is, effec- 
tively, as it was prior to stimulation. 
The discrimination learning paradigm 
again argues that the degree of condi- 
tioning will be directly related to the 
difference between S,. and the CS, 
where this difference is partially a 
function of the time since the onset of 
the CS.’ 

The third general class of empirical 
phenomena for which V is directly 
applicable refers to primary stimulus 
generalization along intensity as a 
continuum. Let S,., S:, and S» be as 
above, and choose another stimulus 
intensity (S;) which is (a) more in- 
tense than Sy, and (6) equal j.n.d. 
steps away from S:2 as is S;. After 
the CR is established to Ss, generalized 
response tendency is obtained at S, 
and S;. Assuming that equal j.n.d. 
separation means equal difference, 
habit generalization from Se» should 
be the same to each of the two test 
stimuli. However, it is found that 
the response strength is greater at S; 
than at S; (e.g., 3) which Hull would 
deduce on the basis of the difference 
in V. 

If the same conceptualization as 
presented above is followed, the orig- 
inal learning involves a discrimination 
between S,. and Se; the inhibition at 
the former will generalize not only to 
S. but to other stimulus intensities. 
Since we have assumed that similarity 
is a monotonic function of intensity, 
S,; will be more similar to S,. than will 
S;; S; will therefore receive greater 


3 Hull’s postulate of the stimulus trace (s) 
does not contain the intensity (S) of the 
stimulus, but only time (f) since its onset. 
The most useful interpretation is that s is a 
calculational device so that V = f(S,t). The 
analysis offered in this paper would favor the 
postulate s = f(S,t,S..). 








STIMULUS INTENSITY DYNAMISM 79 


generalized inhibition from S,. than 
will S;. Thus, although S; and S; 
will each receive equal generalized 
habit from Se, sEe will be greater at 
S; than at S; because there will be less 
generalized inhibition opposing it; a 
greater probability of CR is therefore 
expected at the stronger generalized 
stimulus. 

According to the derivation given 
here, if the CS were a decrease in in- 
tensity ( a strong intertrial stimulus), 
then greater generalized response 
strength should occur to a stimulus 
of weaker intensity than to a stimulus 
equally different from the original CS 
but stronger. Here, the implication 
of V is diametrically opposite. 

The fourth and final general class of 
phenomena which involves the use of 
V occurs in a simple discrimination 
between S; and S, (used as above) 
obtained by the single presentation 
method. An organism is placed in a 


starting box and, shortly thereafter, 
a guillotine door is raised exposing a 
hinged door of either light or dark 


gray. This changes the stimulus 
complex into either S; or Se, in only 
one of which locomotion through the 
door is rewarded. The response 
strength to the positive stimulus is 
found to be greater following the dis- 
crimination training if the more in- 
tense of the pair has been the positive 
stimulus (e.g., 1). Hull has derived 
this result on the basis of the greater 
V at the more intense stimulus. 

If, however, S.. is also considered, 
it will be immediately seen that, when 
S; is the reinforced stimulus complex, 
both S,. and Sz will be accruing inhibi- 
tion which will generalize upon S; 
from both sides. When, however, S2 
is the positive stimulus, it will receive 
the same amount of inhibition general- 
ized from S; as, in the reverse case, 
S: received from S2; but the general- 
ized inhibition from S,, will be less to 


S2 as the positive stimulus than was 
the case to S; as the positive stimulus. 
Since S2 would therefore receive the 
less total generalized inhibition were 
it the positive stimulus, sKr would be 
greater when S; is positive than when 
S; is positive. 

It should be possible to employ 
single presertation discrimination 
learning, but to insure that the subject 
never experiences the contextual en- 
vironment of the stimulus except at 
times when either the positive or 
negative stimulus is present. This 
would preclude the development ot 
inhibition of 5... Under such condi- 
tions, the analysis followed here would 
deduce that sEr would be identical 
whether the weaker or the more in- 
tense of the pair was the positive 
stimulus. 

The derivations followed above have 
been more substantiative than exact 
on the assumption that anyone famil- 
iar with the theory will have sufficient 
facility with its application to dis- 
crimination learning to follow the 
sketch presented. It will be immedi- 
ately apparent that this discrimina- 
tion analysis leads to similar deduc- 
tions as obtained by the use of V if 
three assumptions are fulfilled: (a) 
the subject is exposed to the contex- 
tual environment of the relevant 
stimulus, that (6) during such expo- 
sure there is a zero or minimal inten- 
sity of that relevant stimulus, and that 
(c) any performance of the response 
during these intertrial conditions is 
nonreinforced. Differential implica- 
tions have been suggested if these 
assumptions are not met. 

While the writer is not aware of 
research bearing directly upon these 
implications, several incidental find- 
ings seem to favor the present analy- 
sis. A number of experimenters have 
used the offset of a tone as a CS, ob- 
taining satisfactory conditioning even 





80 FRANK A. LOGAN 


though dynamism would be near zero. 
Since V is assumed to enter multiplica- 
tively in determining excitatory po- 
tential, it would force sEr to zero and 
predict no conditioning. 
common, though typically unreported, 
experience to observe the occurrence 


Also it is 


of the response between trials more 
frequently early in training than later. 
This would be with the 
hypothesis that the response becomes 
extinguished to the contextual inter- 
trial stimulus conditions. 

Subsequent experimentation’ may 
suggest, of course, that both the above 


consistent 


analyses are necessary; that is, that 
there is an effect determined by the 
absolute intensity of the CS over and 
above the effect of the difference be- 
tween the CS and the _ intertrial 

4Subsequent to the preparation of this 
manuscript, Marvin Schwartz has obtained 
unpublished data suggesting that a weaker CS 
is more effective than a stronger one when the 
contextual intertrial stimulus is intense, and 
that the occurrence of the response between 
trials becomes less frequent with practice. 
The writer has also learned by personal com- 
munication that Dr. Charles C. Perkins, Jr. 
has independently obtained comparable re- 
sults. 


stimulus. More exact research is re- 
quired before an adequate formulation 
can be stated. 


REFERENCES 


1. AntornetTI, J. A. The effect of discrim- 
ination training upon generalization. 
Unpublished manuscript, 1950. 
(Quoted in Hull, C. L. A_ behavior 
system. New Haven: Yale Univer. 
Press, 1952.) 

. Grant, D. A., & SCHNEIDER, D. E. In- 
tensity of the conditioned stimulus and 
strength of conditioning: II. The con- 
ditioned galvanic skin response to an 
auditory stimulus. J. exp. Psychol., 
1949, 39, 35-40. 

HovLanp, C. I. The generalization of 
conditioned responses. If. The 
sory generalization of conditioned re- 
sponses with varying intensities of 
tone. J. genet. Psychol., 1937, 51, 
279-291. 

. Hutt, C. L. A_ behavior system. 
Haven: Yale Univer. Press, 1952. 

KEssEN, W. Response strength as a func- 
tion of conditioned stimulus intensity. 
Unpublished doctor's dissertation, Yale 
Univer., 1952. 

RryNoups, B. The acquisition of a trace 
conditioned response as a function of 
the magnitude of the stimulus trace. 
J. exp. Psychol. 1945, 35, 15-30. 


sen- 


New 


(Received for early publication Septem- 
ber 10, 1953) 




















THE BRITISH JOURNAL 
OF PSYCHOLOGY 


Edited by D. W. Harpine 
Vol. XLIV. PartS August1983S 12s. 6d. net. 


OBITUARY NOTICE. David Katz. 

C. A. MACE. Homeostasis, needs and values. 

W. M. O’NEIL. Hypothetical terms and relations in psychological theorizing. 

W. KENNETH RICHMOND. Educational measurement: its scope and limita- 
tions. A critique. 

L. W. SHEARS. The dynamics of leadership in adolescent school groups. 

F. H. GEORGE. ‘Either-or’ questions in series. 

F..A. CHRENKO. Probit analysis of subjective reactions to thermal stimuli— 
a study of radiant pane! heating in buildings. 

G. ROBERT GRICE. Hunter’s test of the absolute and relative theories of 
tra:\sposition. 

IAN M. L. HUNTER. Reply to Professor Grice. 

PUBLICATIONS RECENTLY RECEIVED. 








Vol. XLIV. Part4 November198S3 12s. 6d. net. 


K. R. L. HALL. Studies of cutaneous pain: a survey of research since 1940. 
D. E. BROADBENT. Noise, paced performance and vigilance tasks. 

J. A. DEUTSCH. A new type of behaviour theory. 

> MUNDY-CASTLE. Electrical responses of the brain in relation to be- 


MUKHTAR HAMZA. The dynamic forces in the personalities of juvenile de- 
linquents in the Egyptian environment. “s 

F. V. SMITH, W. SLUCKIN and D. GRAHAM. The efficiency of differently 
constituted groups of children in different types of tasks. 

A. H. D. TOZER and H. J. C. LARWOOD. An analysis of intelligence test 
scores of students in a university department of education. 

KATHLEEN P. WATTS. Influences affecting the results of a test of high- 
gtade intelligence. 

CYRIL A. ROGERS. The structure of verbal fluency. 

GEORGE HUMPHREY. Five years in the Oxford Chair. 

PUBLICATIONS RECENTLY RECEIVED. 





The subscription price per volume, payable in advance, 
is 40s. net (post free). (U. S. $6.50). 





Subscriptions may be sent to any bookseller or to the 


CAMBRIDGE UNIVERSITY PRESS 
Bentley House, Euston Road, London, N. W. 1 





























